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Analog World 


"Everything is going digital". Cell phones, television, video disks, 
hearing aids, motor controls, audio amplifiers, toys, printers, what have you. 

Analog design is obsolete, or will be shortly. Or so most people 
think. 

Imminent death has been predicted for analog since the advent of the 
PC. But it is still here; in fact, analog ICs have been growing at almost 
exactly the same rate as digital ones. A digital video disk player has more 
analog content than the (analog) VCR ever did. 

The explanation is rather simple: the world is fundamentally analog. 
Hearing is analog. Vision, taste, touch, smell, analog all. So is lifting and 
walking. Generators, motors, loud-speakers, microphones, solenoids, 
batteries, antennas, lamps, LEDs, laser diodes, sensors are fundamentally 
analog components. 

The digital revolution is constructed on top of an analog reality. 
This fact simply won't go away. Somewhere, somehow you have to get into 
and out of the digital system and connect to the real world. 

Unfortunately, the predominance and glamour of digital has done 
harm to analog. Too few analog designers are being educated, creating a 
void. This leaves decisions affecting analog performance to engineers with 
a primarily digital background. 

In integrated circuits, the relentless pressure toward faster digital 
speed has resulted in ever-decreasing supply voltages, which are anathema 
to high-performance analog design. At 350nm (3.3V) there is still enough 
headroom for a high-performance analog design, though 5 Volts would be 
better. At 180nm (1.8V) the job becomes elaborate and time-consuming 
and performance starts to suffer. At 120nm (1.2V) analog design becomes 
very difficult even with reduced performance. At 90nm, analog design is 
all but impossible. 

There are "mixed signal" processes which purportedly allow 
digital and analog circuitry on the same chip. A 180nm process, for 
example, will have some devices which can work with a higher supply 
voltage (e.g. 3 Volts). While such an addition is welcome (if marginal), the 
design data (i.e. models) are often inadequate and oriented toward digital 
design. 
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Hence this book. It should give you an overview of the world of 
analog IC design, so that you can decide what kind of analog function can 
and cannot, should and should not be integrated. What should be on the 
same chip with digital and what should be separate. And, equally 
important, this book should enable you to ask the right questions of the 
foundry, so that your design works. The first time. 


ok ok ok 


You will find that almost all analog ICs contain a number of 
recognizable circuit elements, functional blocks with just a few transistors. 
These elements have proven useful and thus re-appear in design after 
design. Thus it makes sense to first look at such things as current mirrors, 
compound transistors, differential stages, cascodes, active loads, Darlington 
connections or current sources in some detail and then examine how they 
are best put together to form whole functions. 


* ok *k 


Academic text books on IC design are often filled with mathematics. 
It is important to understand the fundamentals, but it is a waste of time to 
calculate every detail of a design. Let the simulator do this chore, it can do 
it better and faster than any human being. An analysis will tell you within 
seconds if you are on the right track and how well your circuit performs. 
Assuming that you have competent models and a capable simulator, an 
analysis can teach you more about devices and circuits than words and 
diagrams on a page. 
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1 Devices 


Let's assume your IC design needs an operational amplifier. Which 
one? If you check the data-books of linear IC suppliers, you'll find 
hundreds of them. Some have low current consumption, but are slow. 
Others are quite complex, but feature rail-to-rail inputs and outputs. There 
are inputs which are factory-trimmed for low offset voltages, outputs for 
high currents, designs for a single supply voltage, very fast devices, etc. 

Here is the inherent problem with analog building blocks: there are 
no ideal designs, just configurations which can be optimized for a particular 
application. If you envisioned a library from which you can pull various 
analog building blocks and insert them into your design, you are about to 
experience a rude shock: this library would have to be very large, 
containing just about every operational amplifier (and all other linear 
functions) listed in the various data-books. If it doesn't, your IC design is 
bound to be inferior to one done with individual ICs. 

In short: There are no standard analog cells. If your application is 
the least bit demanding, you find yourself either modifying previously used 
blocks or designing new ones. In either case you need to work on the 
device level, connecting together transistors, resistors and rather small 
capacitors. 

To do this you need to know what devices are available and what 
their limitations are. But above all you need to understand devices in some 
detail. The easiest way to learn about complex technical things is to follow 
their discovery, to have the knowledge gained by the earlier men and 
women (who pioneered the field) unfold in the same way they brought it to 
light. 
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Semiconductors 


In 1874 Ferdinand Braun was a 24-year old teacher in Leipzig, 
Germany. He published a paper which was nothing short of revolutionary: 
he had found that some materials violated Ohm's law. Using naturally 
formed crystals of Galena (lead sulfite, the chief ore mineral of lead) and 
other sulfites, he pressed a spring-loaded metal tip against their surfaces and 
observed that the current through this arrangement was dependent on the 
polarity of the applied voltage. Even more puzzling was the fact that, in the 
direction which had better conduction, the resistance decreased as the 
current was increased. 

What Braun (who later would give us the CRT) had discovered, we 
now know as the diode, or rectifier. It was not a very good one, there was 
only a 30% difference between forward and reverse current. And there 
were no practical applications. Braun could not explain the effect, nor 
could anybody else. 

In 1879 Edwin Hall of Johns Hopkins University discovered what 
was later named the Hall Effect: when you pass a magnetic field through a 
piece of metal it deflects the current running through the metal. In all the 
metals he tried the deflection was to one side; he was greatly relieved to see 
that this confirmed the negative charge on the electron. 

But then the surprise came. In some materials the deflection went 
the other way. Where there perhaps positive electrons? 

Nothing much happened until about 1904. Radio appeared on the 
scene and needed a "detector". The signal was amplitude modulated and to 
make the music or speech audible the radio frequency needed to be rectified 
(i.e. averaged). Thus, 30 years after Braun's discovery, the "odd behavior" 
of a wire touching Galena (and now many other materials, such as silicon 
carbide, tellurium and silicon) found a practical application. The device 
was called the "Cat's whisker", but it actually didn't work very well; one 
had to try several spots on the crystal until one was found which produced a 
loud enough signal. 

And it was replaced almost immediately by the vacuum tube, which 
could not only rectify but amplify as well. Thus the semiconductor rectifier 
(or diode) went out of fashion. 

It was not until 1927 that another practical application appeared: 
large-area rectifiers. These were messy, bulky contraptions using copper- 
oxide (and later selenium) to produce DC from line voltage, chiefly to 
charge car batteries. But there was still no understanding of how these 
devices worked. 
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In the background, mostly at universities and large corporate 
laboratories, some research went on, despite the fact that there was no 
semiconductor industry yet. In 1931 A.H. Wilson came up with a complete 
model of energy bands: electrons exist only at discrete levels, each with a 
higher energy than the lower one; only two electrons can exist at the same 
level, but they have opposite spins; at the last (or highest level) are the 
valence electrons and there is a gap in energy to the ultimate one, the 
conduction band; once they reached that last level, conduction happens by 
accelerating the electrons in an electric field. 

The theory was fine, but it took 15 years for someone to make a 
connection between it and the diode. 

There were two problems masking the real semiconductor effects. 
First, all the behaviors so far noticed were surface effects. The cat's whisker 
applied a metal wire, the copper-oxide and selenium rectifier metal plates. 
Today this is recognized as a rather specialized configuration, only 
surviving in the Schottky diode. Second, the semiconductor material was 
anything but pure, containing elements and molecules which counteracted 
the desired behavior. 

Then World-War II happened and with it came radar. To get 
adequate resolution, radar needed to operate at high frequencies. Vacuum 
tubes were too slow, so the discarded "cat's whisker" came into focus again 
(employed right after the antenna to rectify the wave so it could be mixed 
with a local oscillator and produce a lower frequency, which could be 
handled by vacuum tubes). 

This time a world-wide emergency drove the effort, with plenty of 
funding for several teams. They started with the "cat's whisker" and tried 
to determine what made it so fickle and unreliable. It became immediately 
obvious that purer material was required, and that this material should be in 
the form of a single crystal. When they heated part of a crystal close to the 
melting point and moved the heated zone, the foreign materials moved with 
it. And now they realized that some of these impurities were actually 
required to get the diode effect. And these impurities all fell into very 
specific places within the periodic table of elements. 

Silicon and germanium both have a valence of four. Valence 
simply means that in the outermost layer of electron orbits there are four 
electrons. Silicon, for example, is element number 14, meaning it has a 
total of 14 electrons. The first orbit (or energy level) has two electrons, the 
second eight and the third four. 

The outermost orbits of the atoms touch each other and the electrons 
in this orbit don't stay with one particular atom, they move from orbit to 
orbit. It is this sharing of electrons that hold the atoms together. And this 
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ability to move from atom to atom is also the basis of electrical conduction: 
in conductors the electrons roam widely and are easily enticed to move in 
an electrical field, whereas in an insulator they stay close to home. 

Electrically, pure silicon is a terribly uninteresting material. It is an 
insulator, but not a very good one. The fun begins when we add the right 
impurities, or dopants. 

Just to the right of silicon in the periodic table is phosphorus, 
element number 15. Like Silicon, it has two electrons in the first orbit, 
eight in the second but there are five in the third. Now let's say we were 
able to pluck out an atom in a block of silicon and replace it with a 
phosphorus atom. Four of the valence electrons of this new atom will 
circulate with the silicon electrons, but the fifth one won't fit in. This 
excess electron creates a negative charge and the silicon becomes what we 
now call n-type. 

This introduction of excess electrons is unlike static charge. When 
you brush your hair so that it stands upright, you have simply moved some 
electrons temporarily. When you "dope" silicon, the charge is permanent, 
fixed in the crystal lattice (and does not become a battery). 

Similarly, to the left of Silicon and one space up in the periodic table 
is boron, element number 5. It has two electrons in a first level and three in 
a second, a valence of three. If we replace a silicon atom with a boron one, 
there is an electron missing and we create a positive charge, or p-type 
material. As with the excess electron in n-type silicon, we can apply an 
electric field and cause a current to flow, but the net-effect is the flow of 
holes, not electrons. This is what makes the Hall effect go the wrong way. 

It is important to understand this mechanism of moving holes and 
electrons in doped semiconductors. In n-type material an excess 
phosphorus electron wanders into the path of a neighboring silicon electron 
and displaces it. The displaced electron then takes the orbit of another one 
and so on until the last electron ends up at the starting point, the phosphorus 
atom. 

This endless game of musical chairs - proceeding at near the speed 
of light - depends greatly on the temperature. At absolute zero there is no 
movement. At about -60°C the movement is sufficient for semiconductor 
effect to start in silicon. At about 200°C there is so much movement that 
silicon practically becomes a conductor. It is only within a relatively 
narrow range, about -55°C to 150°C, that silicon is a useful semiconductor. 

In p-type material the movement starts with an electron in the 
neighborhood of the boron atom. It fills the vacancy and then is itself 
replaced by another electron and so on until the first electron moves away 
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from the boron atom again. The moving is done by electrons, but the net 
effect is a moving hole. 

When an electric field is present the movement takes on a direction: 
electrons flow toward the positive electrode and are replaced by other 
electrons flowing out of the negative electrode. 

It is amazing how few dopants it takes to make n-type or p-type 
material. Silicon has 5x10” atoms per cubic centimeter. A doping level 
can easily be as low as 5x10°° boron or phosphorus atoms per cubic 
centimeter, i.e. one dopant atom for every 10 million silicon atoms. No 
wonder it took so long to discover the true nature of the semiconductor 
effects; in nature, the number of miscellaneous impurities is far larger than 
one in 10 million. 


The Diode 


Even with a dopant present silicon is uninteresting. It is not a good 
conductor and as a resistor it is inferior to metal film or even carbon. But if 
we have both n-type and p-type atoms in the same silicon crystal, things 
suddenly happen. 

Opposite charges attract each other, so the excess electrons near the 
border of the n-type section move into the p-type material and stay there. 
An electron fills a hole and the electric charges cancel each other. 

This only happens over a short distance, as far as an electron (or 
hole) can roam. The resulting region is called the space-charge layer or 
depletion region. 

Now suppose you 

P N connect a voltage to the 
two terminals. If the p- 
region is connected to the 
negative terminal of the 
supply and the n-region to 
the positive one, you 
simply push the charges 
I away from each other, 
Depletion Region enlarging the depletion 
region. 

If, however, the p- 
region is positive and the 
n-region negative, you push the charges closer together as the voltage 
increases. The closer proximity forces more and electrons and holes to 


Fig. 1-1: A depletion region forms between p- 
doped and n-doped semiconductor areas. 
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cross the depletion region. The effect is exponential: at 0.3 Volts (at room 
temperature) very little current flows; at 0.6 Volts the current is substantial 
and at 0.9 Volts very large. 
The expression for the diode voltage is: 
Vd-q 


ie ge or nen kT “ 


where Vd = voltage across the diode 

k = Boltzman constant (1.38E-23 Joules/Kelvin) 

T = the absolute temperature in Kelvin 

q = the electron charge (1.6E-19 Coulombs) 

I] = the actual current through the diode 
and Is = diffusion current 

Note that 1.38E-23 is a more convenient notation for 1.38x10~°. 

The diffusion current Is depends on the doping level of n-type and 
p-type impurities, the area of the diode and (to a very high degree) on 
temperature. A reasonable starting point for a small-geometry IC diode is 
Is=1E-16. 

The equations neglect a few things. There is a limit in the voltage 
that can be applied in the reverse direction. Similar to an arc-over in any 
insulator, there comes a point when the electric field becomes too large and 
the opposing charges crash into each other. This breakdown voltage 
depends on the concentration of dopants: the higher the concentration, the 
lower the breakdown voltage. 

There is a price to be paid for high breakdown voltage. As the 
dopant concentration is lowered, the depletion layer becomes larger and the 
higher voltage pushes it deeper yet. This distance must be accommodated 
in the design. 

The opposing charges in a semiconductor junction are no different 
from those on the plates of a capacitor. So every junction has a capacitance; 
but since the distance between the electrons and holes changes with applied 
voltage, the capacitance becomes voltage dependent. The lower the 
voltage, the higher the capacitance, increasing right into the forward 
direction. 

Lastly, there is resistance in the semiconductor material not taken up 
by the depletion region. For our "typical" concentration of 5E15 (atoms per 
cubic centimeter, giving a practical breakdown voltage in an IC of about 25 
Volts), the resistivity is about 1 Ohm-cm for phosphorus (n-type) and 3 
Ohm-cm for boron (p-type). For comparison, aluminum has a resistivity of 
2.8 microOhm-cm, copper 1.7 microOhm-cm. Resistivity (p or rho) is 
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measured between opposite surfaces of a cube of material with a side-length 
(w, h, 1) of lcm (10mm): 


R*w*h hm cm* 
a2 ~ _ Ohm*cm*cm _ Ohm*cm (or Ohm-cm) 
cm 


The (Bipolar) Transistor 


At the time of the first serious work on the semiconductor diode, 
Bell Laboratories in New Jersey was already world-famous. It attracted the 
brightest scientists and, even among those, Bill Shockley was a stand-out. 
In 1938 Shockley teamed up with Walter Brattain to investigate 
semiconductors. 

The depletion layer intrigued Shockley. There was a faint similarity 
to the vacuum diode. It occurred to Shockley that, if he could somehow 
insert a grid into this region, it might be possible to control the amount of 
current flowing in a copper-oxide rectifier, creating the solid-state 
equivalent of the vacuum triode. Shockley went to Brattain with the idea 
and Brattain was amused. The same idea had occurred to him too; he had 
even calculated the dimensions for such a grid, which turned out to be 
impractically small. Shockley tried it anyway and couldn't make it work. 
Brattain had been right. 

Shockley was not a man easily defeated, though. He modified his 
idea and came up with a different principle of operation. He conceived that, 
since a relatively small number of electrons or holes are responsible for 
conduction in semiconductors and they each carry a charge, he could place 
a metal electrode near the surface, connect it to a voltage and thus either 
pull these carriers toward the surface or push them away from it. Therefore, 
he thought, the conduction of the region nearest the surface could be altered 
at will. He tried it -- and it didn't work either. The idea was identical to 
today's MOS transistor. 

The work stopped there; both Shockley and Brattain were assigned 
to other projects during the war. But in 1945 Shockley was made co- 
supervisor of a solid-state physics group which included Brattain. Shockley 
was 35, Brattain 43. The progress made in refining silicon and germanium 
was not lost on Shockley; he decided to try his idea for an amplifying 
device again and had a thin film of silicon deposited, topped with an 
insulated control electrode. It still didn't work; no matter what voltage was 
applied to the control electrode, there was no discernable change in current 
through the silicon film. Shockley was puzzled; according to his 
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calculations there should have been a large change. But the effect - if there 
was any - was at least 1500 times smaller than theoretically predicted. 

It was at this time, that John Bardeen, 37, joined Shockley's group. 
He looked at Shockley's failed experiment and mulled it over in his head for 
a few months. In March 1946 he came up with an explanation: it was the 
surface of the silicon which killed the effect. Where the silicon stops, the 
four valence electrons are no longer neatly tied up by the neighboring 
atoms. Bardeen correctly perceived that some of them were left dangling 
and thus produced a surface charge (or voltage), which blocked any voltage 
applied to an external control electrode. 

With this theoretical breakthrough the group now decided to change 
directions; instead of attempting to make a device, they investigated the 
fundamentals of semiconductor surfaces. It was a long, painstaking 
investigation; it took more than a year. On November 17, 1947 Robert B. 
Gibney, another member of the group and a physical chemist, suggested 
using an electrolyte to counteract the surface charge. On November 20 he 
and Brattain wrote a patent disclosure for an amplifying device as tried by 
Shockley but using electrolyte on the surface. Then they went to the lab and 
made one. The electrolyte was extracted from an electrolytic capacitor with 
a hammer and nail. The device worked, the electrolyte did precisely the job 
that Gibney thought it would. 

But, although this "field effect" device amplified, it was very slow, 
amplifying nothing faster than about 8Hz. Brattain and Bardeen suspected 
that it was the electrolyte that slowed down the device so, on December 16, 
1947, they tried a different approach: a gold spot with a small hole in the 
center was evaporated onto germanium, on top of the insulating oxide. The 
idea was to place a sharp point-contact in the center without touching the 
gold ring, so that the point would make contact with the germanium, while 
the insulated gold ring would shield the surface. And now, for the first 
time, they got amplification. 

There was only one thing wrong with this device: it didn't work as 
expected. A positive voltage at the control terminal increased the current 
through the device when, according to their theory, it should have decreased 
it. Bardeen and Brattain investigated and found they had inadvertently 
washed off the oxide before evaporating the gold, so that the gold was in 
contact with the germanium. What they were observing was an entirely 
different effect, an injection of carriers by the point contact. They realized 
that, to make such a device efficient, the distance between the two contacts 
at the surface needed to be very small. They evaporated a new gold spot, 
split it in half with a razorblade and placed two point contacts on top. Now 
the device worked even better and they demonstrated it to the Bell 
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management on December 23, 1947. 

For half a year Bell kept the breakthrough a secret. Bardeen and 
Brattain published a paper on June 25. 1948 and on June 30 a press 
conference was held in New York. The announcement made little 
impression; the New York Times devoted a few lines to it on page 46. 

Shockley had been disappointed by the turn of events, he had not 
been part of the final breakthrough. But he realized that, even though there 
was a working device, the battle wasn't over yet. No-one within the group 
really understood precisely how the transistor worked. So, in the early days 
of January 1948 Shockley sat down and tried to figure out what was going 
on between the two point contacts. And in the process he conceived a much 
better structure: the junction transistor. 

It was a brilliant analysis and holds up to this day. In a bipolar 
transistor there is a current flowing between the base and emitter terminals, 
which is a diode. Thus electrons flow from the emitter to the base (so 
named because in the original point-contact transistor it was the bulk of the 
material). Since the base is p-doped, these electrons are the minority 
carriers in the base (hence the name bipolar transistor - carriers of both 
polarities are needed for the effect). A few of them will reach the base 
terminal. But if the base is 
lightly doped and very thin 
most of them will be 
attracted by the positive 
collector voltage before 
they re-combine with a 
hole in the base. In a good 
transistor 100 or even 500 
of the electrons will be 


side-tracked to the 
Fig 1-2: The electrons in the base of an NPN collector while one goes to 
transistor are intended to flow to the base terminal : 
but, if the base is very thin, most of them are the base terminal. Thus we 
diverted by the positive potential of the collector. 


+0.65V__[] Base 


Gr ound 


have a current gain of 100 or even 500. 
The bipolar transistor is an odd 
amplifier, quite non-linear and somewhat 
difficult to use. Considerthe input = a ‘; Gain (FE) = 18 
terminal, the base. It is a diode (with . 
respect to the emitter). You need to lift its 
voltage up to at least 0.6 Volts (at room 
temperature) for any current to flow. 


From that point on the current increases Fig. 1-3: The current flow and gain 
of an NPN transistor. 


nar 
’ 
Collector Current (Ic) 
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exponentially, both in the base and the collector. It is not a linear voltage 
amplifier; only the currents have a (more or less) linear relationship. 

Also notice that the emitter current is always larger than that of the 
collector, since it contains both the collector and base current. 

We have shown here an NPN transistor. If we reverse all the doping 
and the voltages we create a PNP transistor. It works the same way in every 
respect except that it is a bit handicapped: it is slower and has a lower gain; 
holes, now the minority carriers in the base, just don't move as well as 
electrons. 

The point-contact transistor was a nightmare to manufacture and had 
very poor reliability. Also, these devices were made from germanium, 
which has a rather limited useful temperature range. The junction 
transistors were made by alloying dopant materials on either side of a flat 
piece of germanium or silicon. It was difficult to make the base uniformly 
thin and the process created considerable leakage current. 

The next big step was again invented at Bell Labs: diffusion. At 
room temperature gases mix even if they are held perfectly still. This 
happens because each atom or molecule moves around randomly due to the 
energy it receives by temperature. The higher the temperature, the more 
pronounced is this movement and thus the mixing or diffusion. If the 
temperature is high enough (e.g. over 1000°C) such gases can even diffuse 
into solid material, though their diffusion speed decreases enormously. 
Thus, for example, silicon exposed in a high-temperature furnace to n-type 
impurity (gas) atoms develops an n-layer at its surface with a depth as far as 
the impurities penetrate. This may require a temperature close to the 
melting point of silicon and take several hours for a penetration of just a 
few micro-meters, but it is far more controlled than alloying. 

Moreover, you can dope repeatedly. Suppose you have a piece of 
silicon which has been doped n-type. If you diffuse p-type impurities into 
the surface, you convert a layer from n-type to p-type if there are more p- 
type impurities than n-type. The junction is located at the depth at which 
the two impurities are equal in concentration. A second diffusion of a yet 
higher concentration can then convert the material back to n-type again. 
However, you have to pay attention to the fact that subsequent exposure to 
high temperature causes any previous layer to diffuse further. 

There are a few more dopants available too: p-type gallium (rarely 
used) and n-type arsenic and antimony. The latter two have the advantage 
that they diffuse more slowly than phosphorus or boron. For this reason 
they are primarily used early in the process and are thus less affected by 
subsequent diffusions. 

When, in 1956, the three inventors of the transistor were awarded 
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the Nobel Prize for physics, only Walter Brattain was still at Bell 
Laboratories. John Bardeen had left in 1951 to become a professor at the 
University of Illinois and, for his research there in superconductivity, he 
received a second Nobel Prize in 1972. 

Bill Shockley left Bell Labs in 1954. Banking on his reputation, 
which had risen proportionally to the acceptance of the transistor, he 
managed to strike a deal with the Beckman Instruments Company. A 
subsidiary, called the Shockley Semiconductor Laboratories, was set up in 
Palo Alto, California. Shockley's fame had risen to such a height that he 
could pick some of the best people. Within a year he had some 20 people - 
predominantly Ph.D.s - working for him, among them Robert Noyce, 28, 
Gordon Moore, 27, and Jean Hoerni, 32. 

For all of these people there was a brief period of fascination after 
they joined. But then the true Bill Shockley appeared from behind the glitter 
of fame and they discovered that Shockley was, in fact, a rather erratic and 
unpleasant man. He would fire his employees for minor mistakes, throw 
tantrums over trivial problems and change directions for no apparent 
reasons. He incessantly tried innovative management techniques, such as 
posting everybody's salaries on the bulletin board. 

Noyce and Moore were pushing Shockley to make silicon transistors 
using the diffusion approach. Shockley wasn't interested; his hope was for 
his laboratory to come up with an entirely new device, a device which 
would represent as large a step over the transistor as the transistor had been 
over the vacuum tube. 

Now totally dissatisfied, the crew talked to Arnold Beckman, the 
president of the parent company, and informed him of the impossible 
situation. Beckman promised to hire a business-minded individual who 
could act as buffer between Shockley and his staff. But the solution didn't 
work, Shockley refused to let go of the day-to-day decision-making. Out of 
patience, eight staff members reached a deal with the Fairchild Camera and 
Instrument Company and, in October 1957, the group departed. 

The new company, called Fairchild Semiconductor, was at first an 
independent operation, with Fairchild Camera and Instrument holding an 
option for a buy-out. The product they began to develop was the one they 
had proposed to Shockley. The detailed structure of this device, called the 
Mesa transistor, had been tried in germanium before, but not in silicon. It 
required two diffusions, both into the same side of a silicon wafer. The first 
diffusion was p-type, the second n-type, and the difference in depth between 
the two layers created the base region which, for the first time, could be 
made with a high degree of accuracy. The top surface of the transistor was 
then masked with wax and the exposed silicon etched away, giving the 
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remaining piece a mesa-like shape. 

Because of its superior performance, sales of the Mesa transistor 
took off almost immediately, reaching $ 7 million in 1959. But there were 
also problems. The most serious one concerned the reliability of the Mesa 
transistor. The etched silicon chip was soldered onto the bottom of a small 
metal case, leads were attached to the top regions and then the case was 
welded shut. Tiny metal particles, ejected during the welding process, 
floated around inside the case and kept on shorting out the exposed p-n 
junctions. 

Silicon rapidly grows a thin oxide layer when it is exposed to air. 
This is better known as glass (silicon-dioxide) and its growth can be 
enhanced by moisture at high temperature. Some of the dopant gases used 
in diffusion (such as gallium) can penetrate this oxide layer, while others 
are stopped by it. There was, therefore, a possibility that the oxide layer 
could be used as a mask. If the oxide were to be etched off in some places 
but not in others and suitable dopant gases used, diffusion would take place 
only in the areas without oxide. But a study done at Bell Laboratories came 
to the conclusion that an oxide layer exposed to a diffusion is left 
contaminated and must subsequently be replaced by a freshly grown one. 

This bothered Hoerni. He didn't see any reason why the oxide layer 
could not be used as a diffusion mask for both diffusions -- provided he 
would use dopant gases which were stopped by the oxide - and why the 
oxide should subsequently be regarded as contaminated. So he tried it -- as 
an unofficial side project -- and out of the trial came an advance ranking in 
importance second only to the transistor itself: the planar process. 

In preparation for the first diffusion Hoerni spread a photosensitive 
and etch-resistant coating (photoresist) over the top of the oxide and 
exposed it through a photographic plate (mask) carrying the patterns of the 
base regions, using the photographic techniques already developed for 
"printed" circuits. The subsequent etching then only removed the oxide in 
the regions where p-type impurities were to be diffused. After the diffusion 
he closed these oxide "windows" again by placing the wafer in high- 
temperature moisture and then repeated the steps for the second (emitter) 
diffusion. In a third masking step windows could then be etched in the 
oxide to make contact to the two diffused layers. He then evaporated 
aluminum onto the top surface of the wafer and patterned it with the same 
photographic techniques. The wafer could then be scribed (like glass) and 
broken into individual transistor chips. 

The planar process had a whole series of advantages. Of most 
immediate importance was the fact that the junction was automatically 
protected by the oxide, one of the best insulators known. No longer could 
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the metal particles from the welding of the case short it out. Secondly, 
photographic methods could be used to delineate not just one but hundreds 
of transistors simultaneously. Thus individual, delicate masking of each 
transistor was no longer required, giving the planar transistor a huge 
potential for reduced cost. Noyce, who was by now the general manager, 
saw the advantage of the planar process and quietly moved it into 
production. 

There was another advantage to the planar transistor: once the 
dopant enters the silicon it diffuses in all directions, including sideways. 
The P-N junction, therefore, ends up underneath the oxide, never exposed to 
either human handling or the contamination of air. For this reason the planar 
junction is the cleanest (and most stable) junction ever produced. Fairchild's 
customers who, in early 1959, didn't know that their transistors were now 
being manufactured by an entirely new process, were surprised to find 
leakage currents one thousand times smaller than those of previous 
shipments. 

While Fairchild flourished, Shockley Transistor went downhill. It 
was sold twice, then closed in 1969. Shockley became interested in 
sociology and announced a theory called "dysgenics", which proposed that 
poor people were doomed to have low IQs. By the time he died in 1989 his 
reputation was ruined. 


The Integrated Circuit 


In July 1958 Jack Kilby of Texas Instruments conceived that a block 
of germanium or silicon could be host to not only transistors and diodes, but 
resistors and (junction) capacitors as well. This appeared to be enough of a 
variety to make a small circuit, all of it in the same block of silicon. 

The idea was good, but his approach cumbersome. To insulate the 
various components from each other Kilby etched the silicon, in some areas 
all the way through. To connect them together he used gold wires. The 
circuit was very small to be sure, but it was a production nightmare. Each 
tiny block of silicon had to be made individually, including the patterning, 
etching and wiring. When TI's attorneys prepared a patent application they 
looked in horror at the Rube Goldberg-like drawings and had Kilby put in 
some words saying that interconnection could also be made by laying down 
a layer of gold. How this could be done over this three-dimensional 
landscape he didn't say. 

While Kilby was working on his circuits in Texas, a similar but far 
more elegant idea occurred to Robert Noyce in California. Noyce's 
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motivation was primarily cost, not size. He realized that it didn't make 
sense to fabricate precisely arranged transistors on a wafer, cut them apart, 
place them in a housing and arrange them again in on a circuit board; if the 
additional components on the circuit board could be placed on the wafer, a 
considerable number of manufacturing steps could be saved. Noyce had no 
problem visualizing capacitors and resistors made in silicon, he was 
constantly dealing with these (unwanted) effects. What was needed, 
though, was an inexpensive way to connect all these components on the 
wafer. The idea of using wires had no chance in Noyce's mind, it would 
have simply been too expensive. But he saw that, in the planar process, this 
problem was already solved: the aluminum layer used to connect the 
transistors and the wires could also be used between the components. 

In1959 Noyce entered his idea into his notebook and filed for a 
patent application. Kilby's and Noyce's patent applications were clearly in 
interference and a bitter battle between the two companies started in the 
courts. Texas Instruments won because Kilby application mentioned a thin 
film of gold, thus seemingly anticipating Noyce. Fairchild appealed. 

While the two patents were fought over in the courts, neither TI nor 
Fairchild could collect any royalties for integrated circuit, which were 
already showing explosive growth. So the two companies came to an 
agreement, declaring Kilby and Noyce co-inventors of the integrated circuit. 
Shortly after this the appeals court handed down its decision: Noyce, not 
Kilby, was declared the inventor of the IC. 

It could not have been otherwise. Even today every single IC is 
made exactly as Noyce described it, while Kilby's approach has long been 
abandoned. But the most important contributor to the invention of the IC 
was Clearly Jean Hoerni with his planar process, for which he has never 
been adequately recognized. The planar process rates as one of the great 
inventions of the 20th century. 

Robert Noyce died in 1990 at age 62. In 2000 Jack Kilby won the 
Nobel Prize for the invention of the integrated circuit 


Let's take a closer look at a basic processing step in the Planar 
process. First, you need a mask, a piece of flat glass, with an opaque 
pattern on it. The pattern has been generated optically or, more likely, with 
an electron beam. 

The silicon wafer is first oxidized, i.e. a thin SiO2 layer is grown, for 
example by exposing the wafer to steam in a furnace. Instead of oxide, 
nitride or a combination of oxide and nitride is also used. On top of the 
oxide a thin layer of "photoresist" is spread, a light-sensitive emulsion 
similar to that on a photograph. 
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Fig. 1-4: The first step: A light-sensitive and 
etch-resistant layer (photoresist) is spread on 
the wafer and exposed to light through the 

mask. 
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Light is then projected through the mask onto the wafer. The higher 


Fig. 1-5: The photoresist is developed like a 


photograph and the wafer is ready for etching. 


Fig. 1-6: The oxide is etched away and the 
photoresist is removed. 


Fig. 1-7: A gas containing N-type dopants 
(boron, arsenic or antimony) diffuses slowly into 
the surface of the wafer at high temperature. 


Edition February 2005 


/  Photoresist rcs | 


the frequency of the light, the 
greater the detail, so ultra-violet 
light or even x-rays are used. 

The photoresist is then 
developed and the portions not 
exposed to light are washed off. 
(There are both positive and 
negative photoresists; you have 
the choice of removing the areas 
which are either exposed or not 
exposed to light). 

Next the entire wafer is 
immersed in an acid which 
removes the oxide in the areas 
where it is not protected by the 
photoresist. In more modern 
processes a plasma is used; acid 
etches not only downward but 
also slightly sideways 
underneath the photoresist, while 
plasma etches downward only. 

The wafer is then placed 
into a furnace (a quartz tube 
heated to greater than 1000°C). 
A gas carrying the desired 
dopant (in this case boron, 
arsenic or antimony) swirls 
around the wafer and slowly 
diffuses into the surface. 

Note two important facts 
here: 1. There is a crowding of 
dopants near the surface of the 
silicon. With time they will 
diffuse deeper into the silicon, 
but there will always be more 
dopants near the surface. Thus 
any diffused region has a marked 
gradient. 2. Dopants not only 
diffuse downward, but also side- 
ways. (Since supply is more 
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limited at the very edge, the side-ways diffusion extends to only about half 
the distance of the downward one). This places the junction (where n = p) 
underneath the oxide and is thus never exposed to the (dirty) environment. 

After diffusion the exposed silicon surface is covered again by an 
oxide layer so that the wafer is ready for the next masking step, which could 

be another diffusion or the 
etching of contact holes. 
There is an important 
feature here, which should not 
go unnoticed. SiO; is glass, 
Fig. 1-8: After the diffusion the oxide is re- which is transparent to light. 
grown, ready for the next masking step. The light is reflected at the 
bottom of the oxide by the 
silicon and interference patterns are created, i.e. the sum of direct and the 
reflected light eliminates some frequencies. Thus the color of the oxide 
layer depends on its thickness. This not only makes for beautiful 
photographs but, more importantly, it allows subsequent masks to be 
precisely aligned with previous ones. 

Here then is one form of an NPN transistor made with the planar 
process. The substrate (the starting wafer) is doped p-type as the silicon is 
grown. There are three diffusions in succession, the first being rather deep. 
After the diffusions, contact holes are made (with the same basic photoresist 
process), aluminum is deposited over the entire wafer, patterned (another 
photoresist step) and etched away where it is not wanted. 

Alas, this transistor 
has a rather significant 
shortcoming: high collector 
resistance. The current has 
to flow through the region 
between the base and the 
substrate. That is the far end 
of the collector diffusion, the 
end which has the fewest 

Fig. 1-9: A simple planar NPN transistor. dopant atoms and therefore 

the highest resistance. 

Since the invention of the planar process a few more ways of 
fabricating have been added: 

Epitaxy. If you strip a silicon wafer of its oxide and put it into a 
furnace which is filled with gas containing not only a dopant but also 
silicon, you can grow a doped single-crystal layer. As the atoms carried by 
the gas deposit themselves on the surface of the wafer, they will align 


Emitter Base Collector 
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themselves according to the existing crystal structure. 

You can also precede this by diffusing regions into the original 
wafer, so that you will have areas of high concentration underneath the 
epitaxial layer. Even though these regions are buried, it is still possible to 
align subsequent diffusions to them. When a diffused area is re-oxidized, a 
small amount of silicon is consumed (the Si in SiO2), thus creating a small 
depression in the surface. The edges of these depressions are visible at the 
top surface of the epitaxial layer, though the image tends to be blurry and is 
shifted (in most processes) along the crystal axis (around 45°). 

Ion Implantation. You can literally shoot dopant atoms into silicon 
by electrically charging (ionizing) them and then accelerating them with a 
high voltage (several hundred thousand volts). The treatment is somewhat 
brutal, the newly arrived atoms don't end up neatly aligned in the crystal 
structure and an annealing heat cycle is necessary to let the atoms align 
themselves into a crystal structure. 

The number of dopant atoms introduced is generally more accurate 
in ion implantation than in diffusion. Also you can aim implantation for a 
certain depth (but not very deep). In the subsequent heat cycle (and during 
subsequent diffusions) the dopant atoms will diffuse and thus widen the 
layer. The maximum concentration, however, is then not at the surface, but 
at a chosen depth. 

We now have arrived at a modern NPN transistor as made in a 
bipolar (or BICMOS) process. Before growing the epitaxial layer, a heavily 
doped (thus N+) buried layer is diffused (or ion implanted) into the p-type 
substrate. During epitaxy it diffuses somewhat, both into the substrate and 
the new epitaxial layer. 

The next 

Emitter Base Collector Substrate (-V) diffusion is the 
isolation. It is deep 
(and, therefore, also 
wide); it has to connect 
up with the substrate, 
so that the entire n-type 
collector region is 
surrounded by p-type 


Fig. 1-10: A much improved planar, integrated NPN : 
transistor. The buried layer and sinker lowers the regions. A second n- 
collector resistance. type diffusion connects 


up with the buried layer 
(and the emitter N+ diffusion is used on top of it simply because it's 
available at no cost). Now the collector current has a (fairly) low-resistance 
path. 
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This transistor is isolated from its neighbors (and other components) 
as long as the substrate is held at the most negative voltage in the circuit 
(junction isolation). In this way the collector-substrate junction is always 
reverse-biased and only leakage current (pico-amperes) flows. 

There are some flaws and limitations in the performance of this or 
any other bipolar transistor: 

Early Effect, named after Jim Early (then at Bell Labs, later at 
Fairchild), who explained it first. Ideally the collector current should be 
equal to the base current multiplied by a constant gain (hFE or beta). But, 
as we have seen above, each p-n junction has two depletion layers. For the 
collector-base junction, one depletion layer extends into the collector, the 
other into the base. The base is almost always more heavily doped than the 
collector, so its depletion layer 


is fairly shallow. However, the 
base is also very thin, so even a é0 
shallow depletion layer takes 18 Early Effect 
up a significant portion of the i 


Ic / pA 


base depth. As the collector 
voltage increases, the depletion 
layers widen. In the collector 
region this has little effect (as 


long as it doesn't hit the other iL ! f t f 

side of the collector), but in the chi jae 
base region it narrows the 

base-width. Since the gain of a Fig. 1-11: Even with a constant base 
bipolar transistor is very much current the collector current increases with 


the collector-emitter voltage because the 


dependent on the base-width, the depletion layer narrows the base-width. 


gain simply increases as the 
effective base-width decreases. 

If you draw a straight line, extending the slope (from 0.4 to 5 Volts) 
into the negative quadrant and let it intersect with the zero-current line, you 
get the Early Voltage. In this case, for a 5-Volt process, the Early voltage 
is -15 Volts (but is generally expressed as 15V). Depending on the chosen 
base-width, it can be less than that and the slope correspondingly steeper. 


Gain versus Current. For any bipolar transistor the current gain falls off 
both at low and high current. 

First, the low end. There is always a leakage current across any 
junction; for a perfectly clean surface this is the diffusion current. In the 
base-emitter junction this leakage current takes away a portion of the 
supplied base current. In our graph here the current shunted by leakage at 
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the low end (10nA Ie, or about 50pA Ib) amounts to 33% of Ib, i.e. the gain 
has dropped by one third. 

If you extend this plot to much lower current, you will see the gain 
rise to almost infinity. This is nothing more than the effect of the collector- 
base leakage current. 

At the high end two effects take place simultaneously: 1. The 
number of electrons present in the base simply becomes so large that they 
are no longer the minority carriers and the whole effect comes to a halt. 2. 
The base current must flow 
from the contact to the flat 
area between the emitter and 30 
collector. At low current this 
is no problem, the resistance 
in the base is sufficiently 
small. But as the collector 
current increases (and with it 
the base current), the 
resistance in this flat region P 
of the base causes a 
significant voltage drop, and 
the far end gets less current. 
Eventually, as the current is 
increased even more, only the 
edge of the emitter on the side of 
the base contact is active. Thus 
the high-current capability of a bipolar transistor is determined not by the 
emitter area, but by the 
active emitter length, i.e. 
emitter periphery to which 
the base can supply current 
through low resistance. A 
good starting point for the 
maximum current (at which 
the gain drops to 50%) is 
1.5mA per um of active 
emitter length, but this value 
varies from process to 
process. 

To increase the 
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Fig. 1-12: The current gain (hFE) of a 
bipolar transistor drops off both at low and 
high currents. 


Collector Contact 


Emitter 
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Fig. 1-13: Minimum-geometry NPN transistor on 
the left and higher-current design on the right. 
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place base contacts on both sides of the emitter and lengthen the emitter. 
Shown here on the left is the top view of a minimum-geometry transistor 
and on the right a version for higher current. 

To make the life of a designer easier, the isolation pattern is usually 
drawn as a rectangle and then inverted when making the mask, i.e. the 
isolation diffusion is actually between devices, not in the device area. 

Many processes require that all contacts be the same size, in which 
case the contact rectangles must be broken up into small, identical (and 
properly spaced) squares. 

Be aware, that transistors of different sizes (as drawn here) do not 
match well. At low current a large emitter area produces a higher gain than 
a small one, because the minority carriers have a higher chance to be 
captured by the collector. If you want to produce a precise ratio, use only 
one emitter size and identical base contacts. The emitters can be in a 
common base area and the collector size is of no consequence except for 
collector resistance (or saturation voltage). 

Substrate Current. There is only leakage current across the 
collector-substrate junction, unless the transistor saturates. 

Assume the collector is connected through a resistor to the positive 
supply voltage and the base is driven so hard that the collector voltage drops 
to near the potential of the emitter (termed 


saturation). eee 

There are now two diodes in parallel and 
the base current has two paths; the new one formsa |. NPN #— 
PNP transistor with the NPN base becoming the 2 
emitter, the NPN collector the base and the NN Lv 
substrate the collector. Since the NPN collector is _ 
much larger than its emitter, some (or all) of the 
base current flows to the substrate. 

There is little danger in this, except when ee ee 
you drive the base very hard, trying to get the Fig. 1-14: When an NPN 
lowest possible collector voltage, or if you have transistor saturates a 


stray PNP device leaks 


many saturating NPN transistors. The path in the SuirchiiG ihe cubenaic: 


substrate from a transistor to the -V connection has 
some resistance. If the substrate current is so large 
that the voltage drop across this resistance can forward-bias some substrate- 
collector junction on the way, you may get some really bad effects, 
including latch-up. 

Maximum Voltage. To get a high operating voltage requires high 
resistivity - low doping concentration. But there is a price to be paid: the 
depletion regions become wide. 
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Let's use the integrated NPN transistor as an example. There are 
two depletion regions, one extending into the epitaxial layer from the base 
(downward and side-ways), the other into the epitaxial layer from the 
isolation. To make sure the first one does not reach the substrate (and thus 
cause premature 
breakdown or punch- 
through), the epitaxial 
layer must be deep - 
which means that the 
isolation diffusion must 
be deep, and thus wide. 

Fig. 1-15: At higher operating voltage the depletion Look at the left 
regions around the NPN transistor become larger. side of the transistor. 
The spacing between the 
isolation (as drawn) and the base must accommodate the following: 
the side-ways diffusion of the isolation, 
the isolation-collector depletion region, 
a safety margin for possible misalignment, 
the collector-base depletion region and 
the side-ways diffusion of the base. 
In addition there is also a high-voltage depletion layer each between the 
base and the sinker and the sinker and the isolation, as well as a deeper (and 
thus wider) sinker. All this adds up to a painfully large area. 

The increase in area can be curbed somewhat by two measures: 1. 
Use an additional diffusion for the isolation by creating a P+ region directly 
underneath the normal one before growing the epitaxial. The two halves 
will then diffuse toward each other (up-down diffusion) and meet in the 
middle, thus requiring only half the depth and width; 2. Add more 
processing steps, creating both low-voltage and high-voltage devices on the 
same wafer. 

The Miller Capacitance. As we have seen above, the bipolar 
transistor is a very non-linear (exponential) voltage amplifier and cannot 
thus normally be used as one. But it has a voltage gain, and a high one at 
that (several hundred is not uncommon). 

There is an unavoidable junction capacitance between the collector 
and the base. If you feed a current with an ac signal into the base, the 
voltage change on the collector will be much larger than that on the base. 
Thus, looking into the base, the junction capacitance appears multiplied by 
the voltage gain (the Miller effect). Instead of a tiny fraction of a pico- 
Farad you have to deal with 10 or even 100pF. If the base is fed from a 
high impedance (e.g. a current source), the frequency response is then 


Emitter Base Collector Substrate (-V)} 


[mmm 
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nowhere near the advertised f, (cutoff frequency). 

This Miller effect can be reduced by circuit design techniques (e.g. a 
Cascode Stage), but even so most circuits cannot operate much above say 
1/20 of f;. (f, is the frequency at which the current gain drops to 1). 

On the other hand, there is also a benefit. In feedback amplifiers 
you almost always need a compensation capacitor (more of this later). 
Using the Miller effect you can get away with a 5pF capacitor, which 
appears to be as large as InF, a value which would be much too large to be 
integrated. 


The Case of the Lateral PNP Transistor 


It is the world's worst transistor, you couldn't sell it as a discrete 
component: low cutoff frequency, very limited current range and an 
inferior noise figure. But no self-respecting analog IC designer would want 
to be without it. The reason: In either a CMOS or bipolar process no 
additional diffusions are required. 

The emitter and collector are 
formed by the base diffusion (in a 
bipolar process) or the p-channel 
source/drain diffusion (in a CMOS 
process). The current thus flows 
radially (or laterally) along the surface 
from the emitter to the collector. 

The doping levels are all 
wrong. For optimum performance you 
would want the emitter to have a very 
high concentration, the base somewhat 
lower and the collector quite low (to 
accommodate the higher collector 
voltage). Here the emitter and 
collector doping levels are equal, and 
the base is much higher. Thus, to 
allow space for the depletion regions, 
the base width (the distance between the collector and emitter, minus the 
side-ways diffusions) needs to be quite large. Hence the slow speed (it 
takes time for the carriers to travel across the base). Figure on an f, in the 
neighborhood of 30MHz with an operating voltage of 15 Volts; at lower 
voltages the base-width can be narrower which increases f, but also makes 
the Early effect more pronounced. 


Fig. 1-16: The lateral PNP transistor. 
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Despite of all of this, with good surface control you can get a gain 
in excess of 100. But the current range is limited, rarely exceeding 100uA 
for a minimum geometry device. 

And there is somewhat of a problem with substrate current. There is 
a competing PNP transistor, using the same emitter and base, but with the 
substrate (and the isolation diffusion) as the collector. In normal operation 
a current about half the magnitude of the base current flows from the 
emitter to the substrate terminal. When the lateral PNP transistor saturates, 
the substrate current becomes almost equal to the collector current. If you 
don't have a buried layer, it gets quite a bit 
worse. 

One advantage of the lateral PNP 
transistor: the collector can be split into two 
(or more) sections. The emitter current, 
flowing radially outward is collected by the 
segments according to their length at the 
inside. There is a small loss in gain because 
of the gaps, but the matching between the two 
collector currents is excellent. 

In a CMOS process emitter and 
collector are usually formed by the p-type 
diffusion of a p-channel MOS transistor. The 
intervening space (the base-width) is the same 
as a p-channel gate, with poly-silicon on top. 
Connect the poly region to the PNP emitter; it Fig. 1-17: A split-collector 
will act as a static shield and have a (slight) lateral PNP transistor. 
beneficial effect. 


CMOS Transistors 


It took almost 20 years after the invention of the bipolar transistor 
for MOS to make its appearance. Shockley (and many others) had thought 
of this device first, it was (or should have been) much more simple: put a 
plate close to the surface of silicon, connect it to a voltage and move the 
carriers inside the silicon electro-statically. 

The problem was the surface of silicon. Here the silicon atoms are 
no longer neatly tied up with each other by sharing the outermost electrons. 
They face an entirely different material, SiO2 (or worse, some covering 
with unknown impurities mixed in). This material doesn't even have a 
crystal structure, it is amorphous. 
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In 1964 a startup, General Microelectronics, felt it had licked the 
problem with CMOS and brought out the first digital MOS integrated 
circuit. It was one of the worst products ever to hit the market: a large 
portion stopped working within days. The reason: there were elements with 
the silicon-dioxide (chiefly sodium) that carried an electric charge and could 
move. One day the MOS transistor was perfectly functional, the next day it 
was permanently turned on. 

It took another few years to gain an understanding of MOS surface 
physics and make stable MOS transistors. Today the silicon surface is so 
well understood that we can deliberately place a charge into the oxide layer 
that stays there for years, probably even centuries. It is now the dominant 
integrated device, being much smaller than the bipolar transistor. (The 
number of MOS transistors produced every year has long surpassed the 
number of ants in the world. At the time of writing this book, 
semiconductor manufacturers produced some 500 million transistors for 
every person in the world per year). 


The figure 
N-Channel P-Channel i 
seid eee shows a cross-section 
7” Twin prea” daa of the most often 


used (n-well) process. 
There are many 
variations and 
refinements; this is 
only the basic one. 

In the gate 
area the insulating 
layer (Si02 or nitride, 
or a combination) is 
thinned down and silicon is grown on top of it. Since the insulator is 
amorphous, the grown silicon is not single-crystal, it consists of many small 
crystals which do not fit together very well (thus it is called poly-crystalline 
silicon or simply poly). 

Next the source and drain regions are implanted, using a mask. The 
inside edges are masked by the gate, so they align perfectly to the gate (i.e. 
they are self-aligning). The device is also self-insulating: as long as the 
source and drain are at or above the substrate potential (usually ground), the 
junctions to the substrate are reverse-biased and no bulky isolation diffusion 
is necessary. 

For the p-channel transistor the polarities for the source and drain 
implants are reversed and these regions are placed inside an n-type 
diffusion. In most applications one such n-well hosts many p-channel 


Fig. 1-18: Cross-section of an N-well CMOS process. 
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transistors and is simply connected to the positive supply voltage; in this 
way the devices are insulated from each other as long as each source and 
drain is at or below the positive supply. 

In both the n-channel and p-channel transistors, sources and drains 
are identical, i.e. you can arbitrarily call one the source and the other the 
drain. Or one region can do double-duty, being the drain for one transistor 
and the source for the next one, connected in series. 

The p-channel transistor is always at a disadvantage, because holes 
are more difficult to move than electrons. Thus it will have a lower gain 
than an n-channel device (for the same gate oxide thickness) and be 
somewhat slower. (MOS 
transistors, by the way, 
are called unipolar 
devices, because they 
employ only one type of 
carrier, as opposed by the 
bipolar transistor, in 
which both electrons and 
holes are important for 
the operation). 

Now let's look at 
an (n-channel) MOS 
transistor in more detail. 
The basic idea is to create 
a region (a channel) 
between source and drain 
which has the same 
polarity (n-type), so that 


Poly (Gale) 
+2 Voms 


Poly (Gate) 
+2 Volts 


\ 
Depletion Region | 
j 


Fig. 1-19: As the drain voltage is increased, a 
depletion region pinches off the channel. 


there is direct conduction between 
the two. This is done with a positive 
voltage at the gate which pushes 
holes away from the surface and the 
device is called an enhancement- 
mode transistor (there are also 
depletion-mode devices in which a 
channel is implanted or diffused and 
then cut off with a negative gate 
voltage). 

This is true only at zero or 


Drain Current / pA 


0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 


Drain Voltage/V 200mV/div 


Fig. 1-20: Drain current vs. drain voltage 
with the gate voltage held constant. 
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voltage is increased, a depletion 
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region forms around it. Since there is now a voltage drop along the 
channel, with the drain side at a higher voltage than the source, the 
depletion region along the channel gradually increases toward the drain, 
cutting more and more into the channel. Thus the resistance of the channel 
increases. 

The initial slope of the 


drain voltage / drain current 20 
curve is the resistance of the % 
channel without any depletion _ 
layers. The final slope at the 7 


highest drain voltage represents 
its resistance with the depletion 
layer almost pinching off the 
channel. It is an unfortunate 
fact that this region is called 
the "saturation region", which 
clashes badly with the earlier 
definition for the bipolar 
transistor. Fig. 1-21: Drain current vs. gate voltage 
: with the drain voltage held constant. 

Above a certain gate 
potential, which has to be exceeded 
to attract any carriers to the surface (the threshold voltage) an MOS 
transistor is basically a square-law device: doubling the gate voltage results 
in four times the drain current. The measure of gain is the 
transconductance, drain current divided by gate voltage. So again, like the 
bipolar transistor, this is a non-linear device: 


Drain Current / pA 


0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 


Gate Voltage/V 200mV/div 


Ww 2 
Id = k—-(Vags - V,) 


where Id = drain current 
k = transconductance 
W =channel width 
L = channel length 
Vgs = gate-to-source voltage 
V7 = threshold voltage 
or Vgs - Vr = gate voltage above the threshold 


The region below the channel also influences the gain. It forms a 


back-gate. In an n-well n-channel transistor this is the substrate, common 
to all devices. You have no choice but to connect it to the lowest negative 
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voltage. But there is a choice for the p-channel transistor. If you place all 
the p-channel transistors in a common n-well, you get the smallest total area 
and therefore the lowest cost. But if the source of such a transistor is 
operated below the positive supply, the back-gate (the n-well) pinches off 
the channel further and you get a reduced gain (by perhaps 30%). You can 
avoid this by placing this transistor in its own n-well. 


The Substrate PNP Transistor 


In either a bipolar or CMOS process there exist layers which can 
form a PNP transistor with the substrate as the collector. Since the collector 
is permanently connected to the most negative supply voltage, such a device 
has limited use. In a bipolar process a lateral PNP transistor has greater 
flexibility and better performance and is thus almost always preferred. 

In a CMOS process the same is true, but because of historical 
reasons or limited information the substrate transistor is still present. The p- 
type implant for the p-channel transistor forms the emitter, the n-well the 
base and the substrate the collector. The n-well has a large depth, thus the 
PNP base-width is large and the gain rather small (e.g. 10). 


Diodes 


There are several p-n junctions in an integrated circuit, each and 
every one a diode. But few of them can actually be used by themselves 
without unpleasant side-effects. 

Take a simple bipolar process. There are three types of junctions: 

emitter/base, base/collector and collector/substrate 
rs) (all referring to the NPN transistor). The last one is 
hardly ever useful because the substrate is 
permanently connected to the most negative supply 
voltage. The base/collector diode is, as we have 
seen, part of a substrate PNP transistor with a gain; a 
N current perhaps ten times the magnitude of the diode 
Fig. 1-22: Properly current will flow to the substrate. 
planed The emitter/base junction makes a good 
diode, but it has a low breakdown voltage (about 6 
Volts) and the base has a fairly high resistance. You 
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could connect the surrounding collector to the most negative supply voltage 
and thus keep it always reverse-biased. But a much better diode results if 
you short collector and base together, creating a diode-connected 
transistor. The transistor is active, it has gain. Only a small fraction of the 
current flows through the base, which divides the base resistance by the 
current gain. This connection in fact gives you an almost ideal diode over 
about five decades of current. 

If the emitter/base breakdown voltage is too low, consider a diode- 
connected lateral PNP transistor. This devices has the full operating voltage 
of the process, but it is limited in current (see above). 

In a CMOS process the restrictions are even more severe. The only 
free-floating junction is between the p-channel source/drain and the n-well. 
But, as we have seen, these are also part of the substrate PNP transistor. 
Were you to run a current through this junction, a current of about ten times 
its magnitude would flow to the substrate. 

The term "diode-connected" is often used for an MOS transistor 
with its gate and drain connected together. Don't be misled by this term: 
there is no diode as in "junction" diode. 


Zener Diodes 


In a bipolar process the base-emitter diode almost always has a low 
breakdown voltage (perhaps 6 Volts) with a fairly low temperature 
coefficient, which makes it useful as a reference voltage. 

But exercise care with this device: the same junction is also used as 
a fusible device. 

At low current (e.g. less than 100uA for a minimum-geometry 
device) the Zener diode behaves well. As you increase the current the 
region between the emitter contact and the edge of the emitter diffusion 
lights up faintly (a plasma, which you can observe under a microscope, with 
all lights turned off). At some high current level a thin aluminum strip is 
formed abruptly underneath the oxide, which converts the Zener diode into 
a short-circuit. This effect is used for trimming and carries the earthy name 
Zener-zapping. 

Such a Zener diode is also somewhat noisy. For lower noise (and 
better accuracy) use a bandgap reference. 

Moving an n-channel and p-channel source/drain diffusion in a 
CMOS process close together so that they intersect can also result in a 
useful low breakdown voltage, but data for such a device are rarely 
available from the wafer-fab. 
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There are also buried Zener diodes, devices with a special 
diffusion below the surface of the wafer. Such devices have lower noise, 
but the addition to the process tends to be costly. 


Resistors 


Every free-floating layer in an integrated circuit can, when properly 
patterned, become a resistor. But for all of them this is only a secondary 
duty; their intended application is in a transistor, which is the hardest device 
to make. It shouldn't come as a surprise then that their values have a higher 
variation and greater temperature coefficient and their range is more 
restricted than that of even the least expensive discrete resistor. 

Discrete resistors can be tested and adjusted during manufacturing. 
In ICs the manufacturing is done while the silicon is red-hot, at which 
temperature it is no longer a semiconductor; you have to wait until it cools 
down to measure any parameter. 

What saves the integrated resistor is its natural ability to match well. 
Whatever error may have occurred in making one applies to any other on 
the same wafer. They may both be as much as 25% high in value, but both 
will be high by (almost) exactly the same amount. 

The resistance of any material is given by 


rho-l 
R = 
A 
where rho (p) = resistivity in Ohm-cm 


1 = length 

A = area (cross-section) 

If we make a square, i.e. w = 1, then we get a measure of resistance 
which is independent of size, the sheet resistance, in Ohms per square (or 
Ohms/D). 

Note the term is sheet resistance, not sheet resistivity. A square in a 
layer with a sheet resistance of 100 Ohms per square always measures 100 
Ohms from one side to the other no matter how large the square. 

In a bipolar process the layer most often used for resistors is the 
(NPN) base (about 200 Ohms/U)). The emitter layer is more heavily doped 
and thus has a lower sheet resistance (as low as 5 Ohms/L). 

In a CMOS process you have a wider choice: the n+ and p+ 
diffusions (implants) for the drains and sources, the n-well and usually two 
different poly layers. Of these the p+ diffusion (about 150 Ohms/D) and 
one of the poly layers (around 50 Ohms/L) are generally best suited. 

Sheet resistances depend greatly on the process; you should use the 
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values given here only as a starting point and get the actual data (including 
temperature coefficients and tolerances) from the wafer fab. 

Diffused resistors must be placed in an island of opposite doping 
and this island must be connected to a bias voltage so that the junction is 
reverse-biased. For example, a (p-type) base resistor must be in an n-type 
(epi) island. This island (sometimes called the "tub") can contain just one 
resistor or all of them, but its voltage must be at a level equal to or greater 
than the largest voltage on any resistor. In this case the easiest and safest 
connection is to +V. 

Diffused resistors (and to a lesser degree, poly resistors) have a 
voltage coefficient. The biased surrounding layer pushes a depletion region 
into the resistor, reducing its cross-section. As the difference in voltage 
between the resistor and the surrounding layer becomes larger, the depletion 
region widens, the cross-section becomes smaller and the resistance 
increases. This effect is especially pronounced in lightly doped layers: the 
n-well in CMOS and implanted resistors in a bipolar process. (The latter 
uses an additional implant to create a high sheet resistance). 

This voltage dependence is especially critical if you have two (or 
more) resistors which need to match but are at different DC levels. You can 
place each resistor in a separate island, biased at the positive end of its 
resistor. Or you can simply accept the change caused by the depletion layer 
and adjust the ratio. For this, however, you need a model for the resistor 
which includes its voltage dependence. (in a 200 Ohms/H base layer, for 
example, the change in resistance is about 1% for a 5V bias difference). 

There is also a (distributed) capacitance associated with an 
integrated resistor, low for poly, higher (and voltage dependent) for diffused 
ones. If you make a high-value (i.e. very long) resistor, this stray 
capacitance can seriously cut frequency response. Also, if there is noise on 
the supply which biases the surrounding region for diffused resistors, it will 
be capacitively coupled into the resistor. Again, a 
good model is required to show these effects in a 
simulation. 

Two correction factors have to be used when 
designing a resistor. The first concerns the width of 
the resistor. In diffused (or implanted) resistors there 
is always a sideways diffusion, which makes the 
actual resistor wider than drawn. The effect of the 
side-ways diffusion is dependent on the width of the 
resistor. 

The second correction factor recognizes the Fig. 1-23: 
end-effect. If the resistor has minimum width, you Resistor contacts. 
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will need to enlarge both ends to place a contact inside. You will then need 
to estimate the resistance of this additional area and of the contact itself 
(totaling perhaps 0.4 squares from the end of the narrow part). 

If you draw a wide resistor, the contacts can be fitted inside the 
resistor, but they will not cover the entire width, even if converted to one 
long contact. There is, therefore a small additional resistance (about 0.2 
squares from the inside edge of the contacts). 

The matching of resistors depends entirely on the width. Sub- 
micron processes are not developed to get good matching, just maximum 
speed. You will find that minimum-dimension devices (all devices, not just 
resistors) match very poorly. When greatly magnified under a microscope 
all edges appear somewhat ragged. The width of a resistor, for example, 
fluctuates considerably. It is only when you make a relatively large device 
that these fluctuations become insignificant and thus devices match well. 
Figure on using something like ten times the minimum width for matching 
of 0.5% or better. 

Because of the end-effect you cannot expect resistors of different 
lengths to match well. For optimum matching use only identical resistors. 
It also helps divide resistors into identical sections and intermingle them 
with other resistors (in the same identical sections) which are intended to 
match. 

One more thing about IC resistors: the Seebeck effect. Discovered 
in 1821 by Thomas Seebeck (and used by Ohm four years later for his 
measurements of resistance), it is the thermocouple effect: metallic 
interfaces at the ends of a wire produce a voltage if the ends are at different 
temperatures. For the contacts of a diffused or poly resistors this voltage is 
between 0.2mV/°C and 1.4mV/°C, depending on the doping level and 
composition of the metal. This is a danger if thermal gradients are present, 
e.g. with a power transistor on the chip. To avoid it, lay the resistor out so 
that beginning and end are close together. 

Pinch resistors (or pinched 
resistors) are sometimes used in bipolar 
processes to get a high resistance without 
wasting a lot of area. The base-pinch 
resistor is simply a base resistor with the 

eee Stes ot emitter diffusion placed over part of it. 
This reduces the effective cross-section 
(only the deepest part of the base diffusion 
is left, which has also the highest 
Fig. 1-24: Top view and cross- resistance). The device needs to be in its 
section of base-pinch resistor. own epi island, with the epi (and emitter 
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diffusion) connected to the positive terminal. A base-pinch resistor is non 
linear, has a low (e.g. 6-Volts) breakdown and a large variation (about 
10:1), but you can cram 100k Ohm of resistance into the space of a 
transistor. 
The second device is the epi- 
N+ (Emitter) satel pinch resistor. The cross-section of 
a long and narrow epi region is 
further reduced by moving the 
isolation diffusions on either side 
very close together. Since the epi 
region usually is of fairly high 
resistivity, a substantial depletion 
region extends into the remaining 
Fig. 10-25: Bulk or Epi-Pinch Resistor.  €Pi region, pinching it off at an 
operating voltage (above the 
substrate potential) in excess of about 5 Volts. Thus, at any voltage higher 
than that, the epi-pinch resistor becomes a current source. The variation of 
this current is high (8:1), but you can create a small current (a few micro- 
amperes) in relatively little space. 


Capacitors 


The oxide insulating the metal interconnection from the silicon (or 
between metal layers) is dimensioned to give minimum stray capacitance. 
Even a small capacitor (say 5pF) would take up an enormous amount of 
space. Enormous at least in microelectronic dimensions. 

Thus fabricators often provide an additional mask step to outline an 
area where the oxide (or nitride) is thinned down considerably, producing a 
higher capacitance (about 2fF/um? - that's femto-Farads, or 10°'°F/um’). 
With this figure (which of course varies from process to process) a 
50x50um area gives you all of 5pF, easily be the most expensive component 
in your chip. If you specify anything greater than 100pF, your colleagues 
may think you have a degree in macroeconomics. 

One plate of the capacitor is always either metal or poly. For the 
second plate you could use a diffusion, but that creates a slight voltage 
dependence (there is always a depletion layer in silicon which widens as the 
voltage increases, adding to the distance between the plates). Poly or metal 
for the second plate are better choices. 

The oxide underneath an MOS gate is already thinned down to 
achieve a reasonable transconductance, so it too has a higher capacitance 
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per unit area than the ordinary (field) oxide. But be careful here. At zero 
(DC) voltage there is no channel (source and drain form the lower plate, the 
gate the upper one), so the only capacitance is the one from the gate to the 
overlapping parts of drain and source. When the voltage exceeds the 
threshold, the channel comes into existence and the capacitance increases 
markedly. Figure 1-26 shown here depicts the behavior of a large 
(10x20um) 3V n-channel 
device. 
MOS Gab Capacithnce There is also junction 
P il a ll capacitance, which you should 
not dismiss lightly. The 
capacitance of a collector-base 
junction per unit area competes 
, quite well with that of an oxide 
capacitor, but is voltage 

a = 7s 15 2258 dependent (though not as much 

vey soomvidv as the MOS gate capacitance) 

Fig: 1-26: The gate capacitance of an MOS _— ald the stray capacitance for 
transistor is greatly dependent on voltage. one plate (collector to 
substrate) is higher. An even 

higher capacitance per unit area is offered by the base-emitter junction, 
though its breakdown is limited (about 6 Volts). The advantage of the 
junction capacitor is the elimination of the additional mask step. 


Capacitance / wuF 


Other Processes 


What we have considered so far are two simple, basic processes, 
requiring as few as 8 masks. There are many variations, all based on these 
two: 

- "Mixed Mode" CMOS, with devices for (somewhat) higher 
operating voltages and additional poly (and metal) layers; 

- BICMOS processes which add full-fledged bipolar transistors to 
CMOS; 

- Bipolar processes with vertical (high-speed) PNP transistors; 

- CMOS processes with some high-voltage devices (SOOV). 

All of these variations have one factor in common: they increase the 
number of masks (and processing steps) required and are thus more 
expensive. However, they tend to make the design of high-performance 
analog circuits easier, especially when both CMOS and bipolar transistors 
are available. 
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CMOS vs. Bipolar 


The debate as to which is better for analog design is as old as the 
devices themselves. Let's examine some of the main points: 

e The bipolar transistor requires an input (base) current, the CMOS 
devices does not. This is strictly true only at DC; at higher frequencies 
there is the input capacitance, which does result in a current. Also, some 
analog designs (see chapter 8) manage to bring this current down to a very 
low level. 

e Bipolar transistors have lower offset voltages. Generally true, but 
offset voltage depends on size. Make a CMOS transistor larger than a 
bipolar one (or use trimming) to achieve equally low offset voltage. 

e Bipolar transistors have lower noise. Again generally true, 
especially at low frequency (1/f noise, see chapter 6). One exception: the 
auto-zero (or chopper stabilized input, see chapter 8). 

e CMOS devices have smaller dimensions. Generally not true. To 
get the required performance in an analog design (matching, gain, low 
noise), CMOS transistors need to be much larger than the minimum 
dimensions of the process would allow. At reasonably high supply voltages 
(3 Volts and above) CMOS and bipolar devices end up about equal in size. 

e Bipolar transistors are better for low-voltage design. True. 
Transconductance in a CMOS device increases as the square of the gate 
voltage above the threshold. If the gate voltage can only go, say, 0.5 Volts 
above the threshold, it takes a painfully large gate-width to get a substantial 
drain current. In the bipolar transistor a ten-fold increase in collector 
current is obtained with only a 60mV (at room temperature) increase in base 
voltage. It is ironic that CMOS is marching toward lower and lower 
voltages, where it is at a serious disadvantage. 


Edition February 2005 1-34 All rights reserved 


Camenzind: Designing Analog Chips Chapter 2: Simulation 


2 Simulation 


In 1972 the Electrical Engineering Department of the University of 
California at Berkeley released the first version of SPICE (Simulation 
Program with Integrated Circuit Emphasis). Donald Pederson, the head of 
the department decided to do this free of charge; after many additions, 
revisions and improvements (done by a “cast of thousands” of graduate 
students) it is still free today. 

The Berkeley SPICE program (originally written in Fortran) has 
been modified and sold by dozens of companies under various names. 
Some of the modifications were useful (such as the adaptation to PC use), 
many others merely served to make these programs incompatible with each 
other. 

So, be aware that there are differences in capabilities and notation 
between Spice programs. Also, it is no longer true that such analysis 
programs running on more expensive workstations under Unix are better or 
faster; some PC programs (notably Simetrix) have outdistanced their Unix 
cousins in both speed and added features. 

Simulation for analog ICs differs greatly from any kind of digital 
simulation. The most important factor in the latter is speed. This has led to 
an ever finer representation of internal capacitances and other stray-effects 
in the models used. In an analog IC, speed is just one of many 
requirements. We rely heavily on matching, and need to know the effect of 
the variations of many parameters in an almost unlimited number of 
combinations with great certainty. Each device also needs to be represented 
accurately over the entire operating range, not just in two states. The 
models, therefore, become the most important factor. 

Unfortunately, the quality of models for analog or mixed-mode ICs 
varies greatly. Some - few - are very accurate and from simulations alone 
you can tell with great certainty how well your design will work in silicon, 
down to the exact distribution of each circuit parameter in production. But 
most models issued by foundries are not in this category, lacking 
information crucial to analog design. 

In the second half of this chapter there is a fairly detailed discussion 
of device models for Spice. This is a somewhat tedious task, but necessary 
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to judge the quality of the models available to you. Read this part lightly 
and then use it for later reference. 


What Can You Simulate? 


A good analog simulator can tell you all you need to know about a 
design. But be aware that simulators which fall into the average category - 
including the most popular one - lack several of the most desirable features. 

In Spice there are three basic simulations: 


DC Analysis 


Let's use a simple example, a 
buffer, a very simple circuit with a 
voltage gain of one. Two NPN 
transistors (Q1, Q2) form a differential 
stage, Q3 is a current mirror and Q4 an 
emitter follower (more about this in 
the next two chapters). For the current 
mirror we use a lateral PNP transistor 
with a split collector (see chapter 1). 

In the first DC analysis we 
continuously change (sweep) Vin from 
zero to 5 Volts and observe the 
output. The simulator tells us that 
the output follows the input, but 
only above about 0.6 Volts and 
below 4.1 Volts (the common- 
mode range). 

You could enhance this 
analysis by repeating it at various 
temperatures, i.e. by automatically 
"stepping" the temperature, either 
at regular intervals or at three or 
four points. While you do all this 
Fig. 2-2: A DC analysis, showing the ae cay mean the mpurerens 

common-mode range. (either at the base of Q1 or at 
either terminal of Vin), the current 


Fig. 2-1: A simple example for 
simulation, a bipolar buffer. 


Output Voltage / V 


0 1 2 3 4 


Input Voltage/V 1V/div 
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consumption (at one of the terminals of Vcc), the substrate current (out of 
the symbol SUB) and even the power dissipation of the entire circuit or any 
component. 

Place a current source from Out to ground and you can determine 
how well the circuit handles a load, i.e. determine the output impedance. 

There are two sub-categories ina DC Analysis. The Transfer 
Function gives you the relationship between two nodes (not used very 
often) and the Sensitivity Analysis tells you which parameters (including 
transistor parameters) are most responsible for a change in a particular 
voltage or current at any node. 


AC Analysis 


The one thing you never want to forget about a Spice AC analysis is 
this: The signal is treated as if it were insignificantly small. You may 
specify a 1-Volt input signal (most people do, it represents zero dB and is 
thus very convenient), but the analysis program will process it without 
disturbing any of the bias levels. If you have a high gain, say 60dB, the 
output plot will show a voltage of 1000 Volts without even blushing. What 
it is intended to show you is the gain relative to the input; the actual values 
taken out of context are often absurd. 

Our plot shows the 
output response of our buffer in 
the most simple AC analysis: a 
1-Volt ac signal at the input on 
top of a DC Voltage of 2V, with 
the frequency swept from 
200kHz to 200MHz. The output 
is in dB relative to the input, so 
OdB is a "gain" of 1, -3dB isa 
"gain" of 0.708 (or a loss of 


200k400k 1M 2M 4M _~ 10M 20M 40M 100M200M 29.2%). 


Gain / dB 


pieaueneniuiene We could also move the 
Fig. 2-3: AC analysis: gain (in dB) vs. AC source into the Vcc supply 
frequency. (make sure there is only one AC 


source per circuit). If we do this 
we can measure how much of a supply's ripple gets into the output, i.e. 
power supply rejection. 
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With equal ease you can measure the AC response of an output 
current relative to an input current. But when it comes to the relationship 
between a voltage and a current a measurement in dB makes little sense. 

Spice also lets you measure the phase of any voltage or current 
(relative to the phase of an input signal). This is of particular interest in 
circuits which employ feedback, but more of this in chapters 6 and 8. 

Remember though, this is a small-signal analysis, done at one 
particular bias point. The AC response (particularly the phase) may be 
different as a real-life signal moves operating voltages and currents. 

An adjunct to AC 
Analysis is the Noise Analysis. 
Here the AC source is turned off 
and the combined effect of all 
noise sources inside the circuit 
(resistors, currents) at the output 
is displayed. The measure is 
nanovolts (or microvolts) per 
root-Hertz. Despite the awkward ie 
name, it is in fact an elegant | 
measure. To get the actual noise i0 700 ik Tok 100K 7M 
(usually in uVrms) you simply Frequency / Hertz 
multiply the value taken from the 
curve by the square-root of the 
frequency interval of interest. For 
example, between 100Hz and 1kHz we read an average of about 12nV/rtHz. 
Multiply this by the square root of 900 and you get 360nVrms of noise, if 
you look at it with filter which cuts out everything below 100Hz and above 
1kHz. Similarly, in the flat (white noise) region between 10kHz and 1MHz 
we would measure about 24uVrms; even though the curve has a lower 
value, the total noise is much larger because of the wider frequency range. 


100 


Output Noise / V/rtHz 
3 


Fig. 2-4: Output noise vs. frequency. 


Transient Analysis 


Here we convert Vin to a pulse source (instead of DC or AC) and 
look at the output not over a voltage or frequency range, but time. You may 
have to make a few trial runs to get the appropriate pulse-width and total 
analysis time. At first the program will choose its own time steps, 
shortening the intervals when a lot of changes happen and lengthening them 
when no changes are taking place. But, if you are not satisfied with the 
resolution, you can dictate what maximum (or minimum) time-step it is 
allowed to take. 
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Output Voltage / V 
Output Voltage / V 
bn PW NY PW 


0 20 40 60 80 100 120 140 160 180 200 0 02 #04 O06 08 1 120614 #16 18 2 


Time/mS 200pSecs/di 
Time/nSecs 20nSecs/div ae poeeeray 


Fig. 2-5: Transient analysis 1: a 1-Volt Fig. 2-6: Transient analysis 2: a 1Vpp 
pulse at the input. sine-wave at the input. 


Change the input to a sine-wave and you will learn a great deal more 
about the circuit. It will be immediately obvious if the circuit can reproduce 
the waveform without clipping it at either the high or low excursions. But 
that is only a rough impression of fidelity. What you need to know is the 
amount of distortion in the waveform. 

In some programs you simply display the sine-wave, click on 
distortion and get the result. But if you want to have the entire information, 
nothing beats a Fourier Analysis. 

The Fast Fourier Transform (FFT) is a routine which extracts the 
frequency components from a waveform. It is rather tricky to use and 
sometimes produces errors. Shown here is the result of a continuous 
Fourier analysis (Simetrix), 
which is both more detailed ' 
and more reliable. 

What we see in the 7 
graph is amplitude versus 
frequency. At 1kHz there is 
the fundamental frequency with 
an amplitude of nearly (but not 
quite) 1 Volt. At 2kHz is the 
second harmonic with an 
amplitude of 100uV, or 0.01%. 
The third harmonic measures 
about 12uV, or 0.0012%. Frequency/kHertz 500Hertz/div 

To get this kind of resolution you need to run the sine-wave for 
many cycles (at least 1000) with Fig. 2-7: Fourier analysis. 
small enough time steps. 


Spectrum(Out) / V 


0.5 1 1.5 2 2.5 3 
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Noise analysis in the small-signal (AC) mode has strict limitations. 
It presumes that the operating voltages and currents are steady. This is fine 
for a circuit which is perfectly linear, but it falls down if a design is non- 
linear, either by design or by mishap. Take, for example, the case of a 
mixer (or modulator). A signal of a particular frequency enters a 
deliberately non-linear block, such as a diode mixer or the phase-detector of 
a phase-locked loop. The non-linearity creates other frequencies (usually 
much lower ones, such as frequency differences), one of which we use and 
amplify. An AC noise analysis is useless here, because it cannot follow 
what happens to the noise as it is transformed by the mixer. 

What we need in such a case is a transient analysis program which 
pays attention to noise sources. Few have this capability; a notable 
exception (again) is Simetrix. 


The Big Question of Variations 


As pointed out in chapter 1, device parameters in an IC vary from 
run to run and from wafer to wafer. The devices are made at temperatures 
at which the material is no longer a semi-conductor. You have to wait for 
the wafer to cool down to measure the parameters of a diffusion. 


o (68.3%) 
20 95.4%) 
3a (99.73%) 


4o (99. 994%) 
- al 


Fig. 2-8: The normal or Gaussian distribution. 
Most parameters follow a "normal" (Gaussian) distribution. There is 


a mean value, at which the number of occurrences is maximum. A 
deviation of +s (sigma) from this point contains 68.3% of all measured 
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values. If you allow the deviation to be three times as large (+3s ) you 
enclose 99.73% of all measurements. 

This sounds like you are discarding only 0.27% of all values, but the 
figure is deceiving. So far we have considered one parameter only, but 
there are many in an integrated circuit. Suppose your design is influenced 
by 50 of them. The total parameter "yield" then is .9973°°, or 87.4%. In 
other words, you would discard 12.6% of all chips on a wafer. 

Unless you simply don't care about cost, you need to design an 
analog IC so no chips are lost because of parameter variations, i.e. the 
design can withstand a variation of each and every device parameter to at 
least 30; 46 would be better. 

But how do you find out how much parameter variation your design 
can take? The answer is Monte Carlo analysis, and only Monte Carlo 
analysis. 

There is in use what is called a "four-corner analysis". Device 
parameters are bundled together in four groups, representing extremes, or 
worst cases. The plain fact is this: it doesn't work for analog circuits. The 
four-corner models are just barely able to predict the fastest or slowest 
speed of digital ICs, but the grouping simply doesn't apply to analog ones. 
In fact, no grouping is possible; a parameter's influence differs from design 
to design. Analog designers who are satisfied with a four-corner analysis 
simply fool themselves into believing that they have a handle on variations, 
when in truth the result is quite meaningless. 

A true Monte Carlo analysis varies the device parameters in a 
random fashion, so that every combination of variations is covered. This is 
also what you get in production. 

You don't need to vary every parameter of a device, only the major 
ones. For example, varying IS, BF and the capacitances in a bipolar 
transistor model is sufficient; the same is true for the threshold voltage, the 
transconductance and the capacitances in MOS transistors. If matching is 
expected, there must be two additional entries, one for the absolute 
variation, and one for the variation between devices on the same chip. 
These "tolerances" are either inserted into the model file directly or 
contained in a separate file, depending on the analysis program used. The 
Monte Carlo program then simply runs the chosen analysis repeatedly, each 
time with a different set of variations, randomly chosen. Our example 
shows the variation over temperature for a bandgap reference (untrimmed). 

How many runs need to be specified for a Monte Carlo analysis? 
There is an easy way to find out. Start with 20. Then increase this number 
until the extremes no longer change. For this analysis 50 runs were used, 
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which is more than needed. But with today's fast computers you can afford 
to go overboard: the analysis took all of 8 seconds! 

This one picture gives 
you the variation of a 
reference voltage over 
temperature, exactly as it will 
happen in production. You 
notice a Gaussian distribution 
(more curves at the center, 
fewer at the extremes). 
Between the top and bottom 
curves lie 99.73% (3-sigma) 
of all circuits. 

Without the Monte 
Carlo analysis we would not 
know how much variation to 


Vref / V 


Temperature/Centigrade 20Centigrade/div 


expect until several wafers Fig. 2-9: The result of a Monte Carlo analysis. 
f 1 diff t lot Each curve represents the behavior of one circuit 
rom several dilterent 1ots (among 50) in production. 


have been tested (a single 
prototype wafer cannot tell you what the variations in production will be). 


Models 


The Diode Model 


As we have seen in chapter 1, there isn’t any one junction in an IC 
which can be used directly as a diode; a “diode-connected” transistor does 
this job with greater accuracy and far fewer side-effects. 

However, a bipolar transistor consists of junctions, 
at least two of them. Thus a model for a junction diode isa J 
fundamental element in models, even in CMOS. In the RS 


model file (which is always in ASCH) you might see the 
following: ig , eis 


-MODEL Diodel D IS=1E-17 RS=20 CJO=0.85E-12 


: "LD; " Fig. 2-10: a simple 
A model statement always starts with a dot. "Diodel dice equivalent 


is the name of the device (which can be anything) and circuit. 
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D says the device is a diode. The remaining entries are in Amperes, Ohms 
and Farads. 

This is about as simple a diode model as you can possibly make it; 
just three parameters are specified. IS, together with the series resistance 
RS, determine the DC characteristics, CJO the junction capacitance. Let's 
first look at the DC behavior. 

As we have seen in chapter 1, the current/voltage relationship of an 
ideal junction is given by: 

Vdq 
I=Ise kT 
where Is = the diffusion current 
Vd = the voltage across the current source (i.e. not including RS) 
q = the electron charge 


k = the Boltzmann constant 
T = the temperature in Kelvin 


In Spice this diode equation is greatly expanded: 


Vd-q EG-q 


Rie © (AT-1):-—~ Vd-q V 4 a 
D=18)e"**) ke NET. ATI+ISR eee) ik a +0.005 ” 


At first this equation might appear utterly complicated, but it isn’t. 
If you look at the first portion (to the left of the + sign), you see three 
multiplied constants: 

N = the Forward Emission Coefficient (1) 

EG = the energy gap, which depends on the material (1.11 for 

silicon); you set this to 0.69 for a Schottky diode, 0.67 for 

germanium and 1.43 for Gallium-arsenide. 

XTI = the temperature coefficient of IS (3). 

AT = the quotient of operating (1.e. junction) temperature to room 

temperature (usually 300K or about 27°C). 

With these three constant you can shape the basic exponential curve 
to what is actually measured. If you don't list them in the model statement, 
they assume the values shown in parentheses. 

The portion of the equation to the right of the + sign adds a leakage 
current, i.e. a small current in excess of the (reverse) current predicted by 
the ideal diode equation. You can modify the shape of its curve with the 
constants NR, M and VJ. 

As shown, the model makes the breakdown voltage of the diode 
infinite. You can limit this with the parameter BV and three companions: 

TBV1 (its first-order temperature coefficient) 
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TBV2 (second-order temperature coefficient) 
IBV (the current at which breakdown is specified) 

There is also a parameter, IKF, which splits the DC curve into two 
regions. In ICs this is very rarely used. 

The series resistance, RS, which also influences the DC behavior, 
has first and second-order temperature coefficients, TRS1 and TRS2. 

The junction capacitance shown in the model, CJO (or CJO) is 
measured at zero voltage. Since its value at different voltages (forward or 
reverse) depends on the grading of the junction (abrupt, diffused, implanted 
etc.), it too is modified by three constants, VJ (1), M (0.5) and FC (0.5). If 
you don't list them in the model statement, the default values in parentheses 
will be used. 

For a voltage across the diode (not including RS) equal to or less 
than the product of FC and VJ, the formula is: 


-M 
C=CJO.(1-¥3) 

If the voltage across the diode is greater then FCxVJ: 

C= CJO.(1- Fc) "™) .(1- FC. (14+ M)+ M4) 

There are two noise sources in a diode: the resistor RS and the 
current I1. Without any additional parameters these are treated as white 
noise sources, i.e. the noise is the same at any frequency. (For a more 
detailed look at noise see chapter 6). Since there is also flicker noise (which 
increases at low frequency), two constants, KF and AF are used and the 
following expression is added to the current source noise: 

KF .n*" 
f 


The Bipolar Transistor Model 


42 parameters are used to represent a bipolar transistor in Spice. 
While the number may look a bit daunting, it is actually quite straight- 
forward once you are familiar with the Spice diode, and they are placed into 
five groups: 


Base-Emitter Diode. Here we have the plain diode Spice parameters as 
discussed above, but re-named; the abbreviations in parentheses refer to the 
ordinary diode parameters: IS (IS), NF (N), ISE (SR), NE (NR), RE (RS), 
EG, XTI, CJE (CJO), VJE (VJ), MJE (M), FC. The series resistance of the 
diode is divided into two parts: RE (at the emitter end, with the emitter 
current flowing through it) and RB (at the base-contact end). The latter 


Edition February 2005 2-10 All rights reserved 


Camenzind: Designing Analog Chips Chapter 2: Simulation 


starts at RB at low current and drops gradually to the value RBM at a 
current specified by IRB; this reflects the use of the entire emitter at low 
current and only the emitter edge facing the base contact at high current, as 
discussed in chapter 1. 


Current Gain: The main parameter here is BF (the forward beta, or hFE) 
and its temperature coefficient XTB. Without any additional parameter the 
current gain would be the same at any collector voltage or current. The 
Early effect is represented by VAF (the Early voltage, see chapter 1). The 
drop-off at high current is produced by IKF (the current at which hFE starts 
to drop) and NK (the steepness of the drop). ISE and NE of the base- 
emitter diode are responsible for the drop in hFE at the low-current end; 
simply shunting a small amount of base current to the emitter. 


Reverse Current Gain: You may be convinced that you will never operate 
a transistor with the collector and emitter interchanged, but just in case 
provided the parameters BR, NR, VAR, IKR and TR . 


Base-Collector Diode: Here again we have the basic diode Spice 
parameters, again renamed: ISC (IS), NC (N), RC (RS), XTF (XTD, CJC 
(CJO), VIC (VJ), MJC (M) and TF (TT). The last one is the transit time 
(now through the base to the collector) which accounts for any delay which 
cannot represented by capacitance alone; it is embellished by ITF (which 
makes TF dependent on current), VTF (showing dependence of TF on base- 
collector voltage) and PTF (an excess phase at a frequency 1/(TFx27)). 


Noise: As in a simple diode, additional low-frequency (flicker) noise is 
represented by the parameters KF and AF, but here they work on the 
collector current. 


The Spice model for an integrated bipolar transistor can have either 
three or four terminals. The fourth terminal (of an NPN transistor) is the 
substrate and between it and the collector there is a diode, represented by 
the five parameters ISS, NS, CJS, VJS and MJS. This is a major flaw in 
Spice, for a mere diode here is inadequate. When the transistor saturates, a 
substantial portion of the total current flows to the substrate, which this 
model simply ignores. 

Fortunately Spice also contains a solution to this problem. To 
represent an NPN transistor correctly, you need to add a second transistor; 
Spice lets you combine the two (or any number of devices) in a subcircuit. 
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When N1 saturates (i.e. the collector drops below the base 
potential), the PNP transistor P1 becomes active and draws base current to 
the substrate. This is what happens in real life: there is a stray PNP 
transistor, formed by the base (P), the collector (1.e. the epitaxial layer, N) 
and the substrate (P). 

Rather than put the stray 
capacitance (and leakage current) between 
collector and substrate into the PNP 
transistor (which is somewhat 
cumbersome) a separate diode DCS is 
inserted. DZ, a Zener diode, corrects 
another flaw of Spice: there are no 
breakdown voltages in the bipolar 
transistor model; for the base-emitter Fig. 2-11: Equivalent circuit for 
diode we need this effect, it is sometimes an integrated NPN transistor. 
used as a Zener diode. If you also want 
to have a collector-emitter breakdown, place an additional Zener diode 
between collector (cathode) and base. 

The model for this subcircuit looks as follows: 

SSUBCKT NPN1 1234 
Q1123NI1 

Q2412P1 

D123 DZ 

D2 41DCS 

ENDS 

The first line, after the SUBCKT (all models start with a dot) lists 
the name of the subcircuit and the order of connections (which will be 
followed in the netlist). The next four lines list the device types, the 
connections and the name of the model. The last line signified the end of 
the subcircuit listing. 

Spice lets you define global nodes. This is especially convenient 
for the substrate and avoids cluttering up the schematic with unnecessary 
lines. This feature is used throughout this book for bipolar devices. 
However, you will have to remember to place the contact to the substrate 
(SUB) at the appropriate point (almost always the most negative supply). 

Spice now needs a model for each of the devices used. For example 
(for a 20-Volt process): 

MODEL N1 NPN IS=3.8E-16 BF=220 BR=0.7 

+ ISE=1.8E-16 IKF=2.5E-2 NK=0.75 IKR=3E-2 NE=1.4 VAF=60 

+ VAR=7 RC=63.4 RB=300 RE=19.7 XTB=1.17 XTI=5.4 

+ TF=1.5E-10 TR=6E-9 XTF=0.3 VTF=6 ITF=5E-5 CJE=0.21E-12 


1 (Collector) 


3 (Emitter) 4 (Substrate) 
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+ MJE=0.33 VJE=0.7 ISC=5E-12 KF=2E-13 AF=1.4 
-MODEL P1 PNP IS=1E-15 BF=100 CJE=0.175E-12 XTI=5.4 


+ MJE=0.38 VJE=0.6 


-MODEL DZ D IS=1E-18 RS=250 BV=5.9 IBV=10UA 


+ TBV 1=1.8E-4 


-MODEL DCS D IS=1E-17 RS=10 ISR=5E-12 CJO=0.85E-12 


+ M=0.42 VJ=0.6 


Note that the model for DZ has no capacitance; this is already 


present in the base-emitter diode of N1. 


The Model for the Lateral PNP Transistor 


For a lateral PNP transistor the Spice bipolar transistor model alone 


is woefully inadequate. This type of transistor 
not only produces a substrate current when it 
saturates but also in its normal operation; 
neither of these is present in the Spice model. 


To correct this flaw, we need to use a 2 (Base) 


subcircuit again, only this time two additional 
transistors are required, one to cause the 
substrate current at saturation (Q21) and one at 
normal operation (Q31); the parameters of the 
latter (particularly IS and BF) are chosen so that 
the substrate current is smaller than that of QI1 
(generally about 20%). 
The model for this subcircuit looks like this: 

SUBCKT PNPI1 123 4 

QP11 123 QP1 

QP21 42 1 QP2 

QP31 4 2 3 QP3 

ENDS 


3 (Emitter) 
Q11 Q31 
: K 7 ¥ 
: Q21 
1 (Collector) 4 (Substrate) 


Fig. 2-12: Equivalent circuit 
for a lateral PNP transistor. 


And the models, again for an arbitrary example of a 20-Volt 


process: 
-MODEL QP! PNP IS=1E-16 BF=89 VAF=35 


+ IKF=1.2E-4 NK=0.58 ISE=3.4E-15 NE=1.6 BR=5 
+ RE=100 RC=800 KF=1E-12 AF=1.2 XTI=5 ISC=1E-12 
+ CJE=0.033E-12 MJE=0.31 VJE=0.75 CJC=0.175E-12 


+ MJC=0.38 VJC=0.6 TF=5E-8 TR=5E-8 
+ XTF=.35 ITF=1.1E-4 VTF=4 XTB=2.3E-1 


-MODEL QP2 PNP IS=5E-15 BF=150 RE=100 TF=5E-8 XTI=5 
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MODEL QP3 PNP IS=1E-18 BF=25 CJC=0.85E-12 
+ MJC=0.42 VJC=0.6 XTI=5 RE=100 


MOS Transistor Models 


Once upon a time there was a company which brought out its own 
variation of the Berkeley Spice program: HSPICE. It specialized in making 
refined models for MOS transistors, many of them. The models were called 
levels and many companies bought their own levels, like boxes at the opera. 
AMD had three of them, Siemens acquired two. Motorola, National 
Semiconductor, Sharp, Cypress, Siliconix and a few others only got one. 
By 1995 there were 39 such levels and us poor ordinary folks couldn't 
access most of them: they could only be used by the company who had 
sponsored them. 

At last good old Berkeley came to the rescue. A team of researchers 
developed the BSIM model (Berkeley Short-channel IGFET Model). The 
team stayed with it, through BSIM1, BSIM2, BSIM3 and even BSIM4. 
These models divided the MOS transistors in ever finer structures, tracking 
the trend toward geometries far below lum. As of this writing BSIM3.3 is 
the dominant model in the industry, leaving the many HSPICE levels in the 
dust. 

Naturally HSPICE took the BSIM models and made its own version, 
adding more levels. 

The increasing BSIM refinements have its toll: the number of 
parameters has become very large, so large that it takes an entire book to 
explain them. For digital ICs, which require utmost speed, this simply has 
to be accepted. For analog designs, which invariably use larger dimensions 
to obtain adequate performance (especially for matching), it is a burden 
only grudgingly tolerated. MOS model-making has become an art 
dominated by the digital realm, of limited use to the analog designer. 

In a modern BSIM model you are confronted by a mass of data 
which almost always is presented in an arbitrary way, lacking an 
organization which would make it more understandable. To help in a minor 
way, they are grouped here; the bold-faced parameters are absolute values; 
all others are modifiers. Parameters in square brackets are temperature 
coefficients. 
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Threshold Voltage: VTHO, K1, [KT1, KT1L], K2, [KT2], K3, K3B, 
DVTO, DVTOW, DVT1, DVTIW, DVT2, DVT2W, VBM, VOFF, KETA, 
PSCBE1, PSCBE2. 


Mobility: UO, UA, [UA1], UB, [UB1], UC, [UC1]. 


Saturation: VSAT, [AT], AO, AGS, Al, A2, BO, B1, DELTA, EM, PCLM, 
PDIBLC1, PDIBLC2, PDIBLCB, DROUT, PVAG, AGS, ALPHAO, 
BETAO. 


Sub-Threshold: ETAO, ETAB, NFACTOR, DSUB. 


Geometry: W0, DWB, DWG, LL, LLN, LW, LWL, LWN, WL, WLN, 
WW, WWL, WWN. 


Capacitances: CGS0, CGD0, CGBO, CJ, MJ, MJSW, PBSW, CJSW, 
MISW, CJSWG, MJSWG, PBSWG, PB, CGSL, CGDL, CKAPPA, CF, 
CLC, CLE, DLC, DWC, ELM, CDSC, CDSB, CDSCD, CIT, 


Resistances: RSH, RDSW, [PRT], PRWB, PRWG, WR, LINT, WINT 


Process Parameters: TOX, XJ, XT, NCH (PCH), NGATE, NLX, NSUB, 
GAMMA1, GAMMA2Z, JS, [XTI], NJ, JSSW. 


Noise: AF, KF, EF, EM, NOIA, NOIB, NOIC 


BSIM models also allow "binning": several models are written for 
different geometries of the same device, and then selected to fit into a range 
of gate width and length with the parameters LMIN, LMAX, WMIN and 
WMAX. While this is not really necessary for the parameters listed above 
(some foundries, notably AMS, manage to create equally accurate model 
without binning), the Monte Carlo variations should be tied to channel 
width and length (i.e. area). Note that the multiplier M is used for 
transistors with a channel width beyond WMAX. 


To get into more detail on the many parameters, you will need to 


consult the original Berkeley documentation (see references). Be 
forewarned: this is a lengthy document. 
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Resistor Models 


A Spice resistor model has no stray capacitance, nor does it 
recognize any possible effect by surrounding layers. There are some cases 
where such a simple model is inadequate. For example, the frequency (and 
phase) response of large-value resistors (SOkQ and more) can be significant 
enough to bring about oscillation in a feedback path. Also, an error is 
introduced in a divider, if the resistors are diffused and placed in the same 
pocket (or "tub"); each resistor is at a different DC potential and their 
voltage dependence will result in slightly different values. This error 
becomes large with ion-implanted resistors. 

Some simulation programs have the capability to extract stray 
capacitances from the layout, but few pay heed to voltage dependence. If 
you want the complete behavior before the layout is done, here is a model: 


SUBCKT RCV 1 2 
R114 RB {m/3} i K Sa 
R245 RB {m/3} 

R356 RB {m/3} ' 4 i 
V1620 Nae ae le 
B1 61 I=I(V1)*(0.0033*((V(3)-(V(1)+V(2))/2))*0.6) ' 5 

D1 1 3 DRSUB {m/2} D3 > RS 

D2 43 DRSUB {m} : 

D3 53 DRSUB {m} pit 

D4 6 3 DRSUB {m/2} er 
ENDS 

MODEL DRSUB D IS=1E-16 RS=50 , 


+ CJO=2.7E-14 M=0.38 VJ=0.6 
Fig. 2-13: Equivalent 

ei : se . . circuit for a 3- 
This is again a subcircuit. The resistor is segment integrated 


divided into three equal sections and the stray resistor. 
capacitance is represented by four diodes to the 
surrounding n-type material (assuming that the resistor is p-type). 

To model the voltage dependence, the current is measured in the 
dummy voltage source V1 (with zero voltage) and from it a current I1 is 
created and subtracted from the total current through the resistor. The value 
of this current is: 


( reveven)y” 
N= FV1)-| 0.0033-| ¥(3)= 


where 0.0033 and 0.6 determine the amount and shape of voltage 
dependence. Note that the bias voltage is applied from terminal 3 to the 
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mid-point of the resistor. B 
(in Simetrix) is an arbitrary 
function, serving as a 
current-controlled current 
source. 


160 Sections 


Contrary to 2 
common belief, a three 
section lumped model is 


dB 


remarkably accurate. 4 3 Sections 
Compare the frequency 

response of such a model 3 

with one that has 160 

sections. i 100k 200k 400k 1M 2M 4M 10M 20M 40M 


Frequency / Hertz 


Fig. 2-14: Comparison of lumped resistor 


Models for Capacitors models. 


There are only two cases where a simple capacitor model (i.e. an 
ideal capacitor) is inadequate: 

1. There is a requirement for unusual precision. If one plate of an 
oxide capacitor is a diffused layer (or a poly layer with a high sheet 
resistance) the capacitance will decrease slightly as the potential across the 
plates is increased. A competent model will reflect this non-linearity. 

2. The capacitor is used at the high-frequency end. Here it is not of 
great importance for the model to show the non-linearity, but to reflect any 
series resistance and stray capacitances from both the lower and the upper 
plate to neighboring regions. 


Pads and Pins 


If you are working at high frequencies - say above 50MHz - you 
need to consider the properties of the pads, the ESD protection devices, the 
bonding wires and the package pins. A pad has a capacitance to the 
underlying layer (usually ground); with an ESD protection device this can 
easily amount to more than 1pF. The bonding wire has an inductance; it 
may be small (perhaps 7nH, but this depends on the length of the wire), but 
it begins to play a role above about 1OOMHz. Then there is the package pin 
capacitance, which is not to ground but between pins (about 1 pF, but 
greatly dependent of the package). 
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Just How Accurate is a Model? 


The quality of device models from wafer-fabs varies greatly. A few 
are outstanding, unerringly accurate and complete. Others are so bad that 
they will almost guarantee major flaws in your design. The majority of 
them are incomplete for analog design. 

It pays to examine the models before starting to simulate. If the 
NPN transistor model is not a subcircuit, use it with caution; behavior in 
saturation is going to be different in the real circuit. If the lateral PNP 
model is not a subcircuit, it doesn't make much sense to use it at all. 

A set of device models is not really ready for use until it has been 
tested in actual circuits. Unfortunately models are commonly put together 
by people who are not designers (especially not analog designers), so they 
tend not to be verified in real-world applications. 

This is especially true for bandgap references (see chapter 7), which 
demand uncommon accuracy from the bipolar transistor models. Even a 
small error in VBE (i.e. the basic diode voltage) and its temperature 
coefficient causes intolerable errors in the reference voltage. Here it is in 
fact preferable to set such parameters as IS (and its modifiers) so that it fits 
(several) designs existing in silicon. 

You should also check the models for the presence of Monte Carlo 
parameters. If there aren't any, you are going to be seriously handicapped 
for an analog design. 
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3 Current Mirrors 


Bob Widlar was a truly great designer of analog ICs. He was wild 
and totally unmanageable and had an odd sense of humor. The press loved 
him and he had a flair for self-promotion. He shunned computer analysis, 
preferring to breadboard his circuits, but time and time again he came up 
with nuggets of design details and products which were thought to be 
impossible. Burned out by the frenzy of Silicon Valley he moved to 
Mexico, where he died in 1991 at age 53. 


One of Widlar's early contribution is the 
current mirror, a design detail (or design element) 
which you will now find in just about any analog IC. 

Start with the primary current, I1, which flows 
into the diode-connected transistor Q1. This produces 
a voltage drop across Q1, namely that of its base- 
emitter diode; this voltage drop is called a VBE. 

Now connect the base and emitter of a second, 
identical transistor, Q2, to the same nodes as those of 


Q1. Since the base-emitter voltage of Q2 is the same 


as that of QI, it follows that its collector current 


Fig. 3-1: The Widlar 
current mirror. 


should be the same as that of Q1 and, therefore I2=I1. 
Well, not so fast. There are errors, two of them. The first one 


Collector Voltage Q2/V 


Fig. 3-2: The current of Q2 depends 
on its collector voltage. 


Edition February 2005 


5 


1Vidiv 


concerns the base currents. I1 splits into 
three paths: the collector current of QI 
and the two base currents. Assuming a 
minimum current gain of 100, each base 
current amounts to 1% of the collector 
current, for a total of 2%. So the collector 
currents of Q1 and Q2 are 2% smaller 
than I1, worst-case. 

The two transistors may be 
identical, but they are not necessarily 
operated identically, which is error 
number two. The collector voltage of Q1 
is always VBE, but the collector voltage 
of Q2 may be anything. As 
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The Current Source 


All current mirrors start with a current source, from which one or 
more currents are derived. For ICs, a current mirror is a more basic 
element than a current source, which is the reason they are discussed 
first. 


However, be aware that there is a significant difference between a 


theoretical current source (as in a simulation) and a practical one. Ina 
simulation a current source will do anything to keep its programmed 
current level, including building up thousands of volts. In an actual 
circuit the supply voltage limits the excursion. 

Also, little distinction is usually made between a current source 
and a current sink (e.g. I1 in figure 3-3) . For convenience all of them 
are usually termed current sources. 


we have seen in chapter 1, the gain is affected by the collector voltage (the 
Early effect), increasing as the collector voltage is increased. Thus [2 is not 
exactly steady. For this particular transistor (made in a process capable of 
20 Volts) the change amounts to 8% from 0.3V (the saturation voltage of 
Q2) to 5 Volts. (11 is 50uA for all examples in this chapter). 

This is the most simple current mirror and, as we shall see in figures 
3-7 and 3-9, we can improve its performance considerably with additional 
devices. There is also a lateral-PNP equivalent. Using a 
split collector (see figure 1-17), this current mirror needs 
only a single device. Each collector being smaller, the 


+V 


a maximum current is more limited (depending on the 
process, about 100uA). 
Ie 1 l2 The voltage 
sou dependence of a PNP 
current mirror is 
ale sus_|_ generally a bit worse 


12/pA 


than that of an NPN 
Fig. 3-3: Current design (here about 
mirror with 
lateral PNP. 12% change). The 
voltage of the second 
PNP collector can move to within | | | | | | 
about 0.3 Volts of +V. If you let it go Voie iaiesweaen! ae 
any higher (or disconnect the collector 
completely), you get a substrate current 
about equal to [1. 


a FB FB Ba a aaa om 


Fig. 3-4: Voltage dependence of I2. 
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The current mirror also works with MOS 
devices, but it is not quite correct to call M1 a "diode- 
connected transistor" (there is no "junction" diode). 

The 
change in current 
is only about 1.5% a‘ 
from | to 3 Volts 
(0.35u process), 
but only because 

Fig. 3-5: Simple the channel 4 
MOS current mirror. lengths were made 
quite large. It takes at least 0.5 Volts at 
the drain of M2 to make the mirror 
work, a figure which you can improve 
by making the devices much wider. The Fg. 36: _ The voltage dependence 
: : : of an MOS current mirror can be 
mirror can be inverted by using p- made smaller by increasing 
channel devices. channel length. 

Now let's see how we can 
improve the performance of the basic Widlar circuit (not that we are any 
smarter than Widlar, but we have had a long time to work on it and have 
much better tools now). 
+V A first step is to place resistors in the emitters 
2 (or the sources in case of MOS). With 6kOhm in the 
ig example here we drop 300mV across the resistors. If 
the current in Q2 wants to be higher than I1, it would 


also cause a higher voltage drop across R2. This latter 
Qt Q2 increase forces I2 back to where it is more or less equal 


5 


12/ pA 


0 0.5 1 15 2 25 3 


Drain Voltage M2/V 500mV/div 


to Il. There 
Aq R2 is, however 
6k 6k : 5 
still the base Pe 
FOsus 
—— — current error, 49. 
Fig. 3-7: Improved — Which is not 49. 
current mirror with improved. a 
emitter resistors. = 4 
There By 


is a penalty: The voltage at the 
collector of Q2 cannot go any 


lower than the voltage drop 48. 
across R2 plus the saturation b 1 2 3 4 4 
voltage of Q2, about 600mV total Collector Voltage Q2/V 1V/div 


for this case. From this point to 
the supply voltage (5 Volts) I2 


Fig. 3-8: Voltage dependence is now 
reduced to 0.7%. 
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changes only about 0.7%. A measure of quality of a current source is its 
output impedance, i.e. the change in voltage divided by the change in 
current. This has now increased from 1.1MOhm for the original current 
mirror to 12MOhm. 

If you want to use emitter resistors in the PNP equivalent, you will 
need to use two separate transistors rather than a split collector. 


—#-++V An even greater improvement can be made with the 
2 addition of a transistor. This circuit, invented by George 
\er Wilson, is naturally called the Wilson Current Mirror 
Sou (analog designers don't get Nobel prizes, they get a circuit 


named after them). Q3 acts as a cascode stage; its sole job 
r—Kas is to shield the important matching transistors, Q1 and Q2 


from any 
ar iS fluctuation in the a 
output voltage. It aie 
tOsus does this job and 49, 
= = more: by a happy 49.4 
Fig. 3-9: coincidence the 3 04 
Wilson current three base currents a 
mirror. ; mn 
cancel and I2 is now Pa 
within about 1% of I1 and changes . 
9.2 
only about 0.09% over the useful 


0 1 2 3 4 5 


voltage range (an output impedance 
of 90MOhm). Collector Voltage Q3/V 1V/div 
Note, however, that the useful 
voltage range stops at a little over 1 
Volt, given by the 
VBE of Q2 plus the saturation voltage of Q3. 
a Naturally there is a PNP equivalent for the Wilson 
current mirror. We can again use a split-collector device 
a2 for Q1. The output voltage can go to within about | Volt of 
+V (at room temperature). Here the improvement is not 


Fig. 3-10: Performance of the Wilson 
current mirror. 


V 


| 7 quite as good (the output current changes by about 0.5%). 
| There is still a systematic error in the basic Wilson 
Fig. 3-11: current mirror: The two transistors intended to match don't 


PNP Wilson have the same collector voltages; one is at VBE, the other 
current mirror. : . : 
at 2 VBE. In the relentless pursuit of perfection, given at 
birth to all analog designers, we shall now proceed to eliminate it. Enter a 
fourth transistor. 
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Fig. 3-12: Four- 
transistor mirror. 


Chapter 3: Current Mirrors 


The only purpose 
of Q4 is to lower the 
collector voltage of Q1 to 
the same level as that of 
Q2. With this [2 is now 
within 0.6% of I1 and 
changes by less than 
0.08% with voltage. 

The single sweep 
in this DC analysis is, 
however, deceiving. The a 
depicted curve can only 
be observed once in a 


1(Q3-C) / pA 


05 1°15 2 25 3 35 4 45 5 


500nV/div 


Fig. 3-13: Performance of the 
four-transistor current mirror. 


great while, when all four devices 


12/pA 


match perfectly. Only a Monte 
Carlo analysis can tell you what 
really will happen in production. 
The remarkably small change with 
output voltage is a fact, but the 
output current will vary by + 3% 


because of mismatch. 


Collector Voltage Q3/V 500mV/div 


Fig. 3-14: Although the output current now 
changes little with voltage, there is still 
considerable variation due to mismatch, as 
a Monte Carlo analysis will show. 


Current mirrors need not 
be restricted to 1:1 relationship 
between input and output current. 
If the critical transistor on the 
output side is increased in size, its 


collector current is increased too. 
In a bipolar transistor the current ratio is 
determined by the size of the emitter 
(more precisely, the active emitter length; 
see chapter 1) but a accurate ratio is in 
practice only achieved if you work with a 


a 
12=11/3 


Is" | 


Fig. 3-15: 1:3 
current ratio. 


Qt a2 number of identical emitters. In figure 3- 
15 QI has one emitter while Q2 has three 

TOsus_|_ (they can all be in the same base), 
Fig. 3-16: 3:1 resulting in a current which is three times 


current ratio. 


that of I1. In figure 3-16 Q1 has three 


emitters and QI one, which causes I2 to have one-third the value of 11. Any 
ratio is possible (such as 3:2 or 5:3). In a CMOS design the ratio can be 
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obtained simply by varying the channel width of one of ——————#-4V 
the transistors, but best matching is achieved by using I2=3xl1 
identical, multiple devices. \s . 


This scheme can be expanded to creating 


multiple currents (i.e. additional transistors with their "" if 

bases and emitters connected in parallel to those of Q2, | | bt 

but their collectors separate) in any ratio you desire. ees 
M=3 


But, in bipolar circuits, there is a limit: the base current 
for each additional emitter is supplied by I1. Thus, with 
Q2 having three emitters (or two additional transistors), 
the systematic errors (with a minimum gain of 100) is 
4%; with 9 additional emitters this increases to 10%. 
There is a solution to this (have you noticed, 
|" 2-10xI1__ there is always a solution, it just takes one or a few 
additional transistors). Here, with the help of Q3, the 
base current for QI and Q2 is supplied not from I1 but 


Fig. 3-17: MOS 
1:3 current ratio. 


Vv 


at ae from the positive supply, thus the base current error is 
"0 divided by the gain of Q3. In this way you can not only 
. os create large current ratios but also drive a substantial 


number of separate transistors. In Fig. 3-18 emitter 


SUB 


- = resistors are used to get less of a change in I2 with a 
Fig. 3-18: an varying output voltage (0.7% from 0.7V to 5V); if you 

soon. have 10 separate transistors, they all get 6kOhm in the 
transistor supplies : : Se ak uae 
the base current, | ©Mitter; if the current is simply multiplied by 10, R2 has 

one-tenth the value of R1. Remember that best matching 

is achieved if the resistors consist of identical sections, i.e. you create a 
basic 600 Ohm device and use one for Q2 and 10 in series for Q1. 

If you are thinking of turning [1 on and off rapidly, be aware that 
this circuit is very slow to turn off; there is no discharge path for the bases 
of Q1 and Q2. A resistor (or another current sink) from these bases to 
ground helps to speed up the turn-off T e n—+V 
time. 


The base current problem does |S uy | 
not exist with CMOS devices, they | 


require no input current. Here you are i ‘ad ie |» F ' 
free to add as many dependent current 7 (= | Ee 
sinks as you desire - if the change in Wede “a wept wept 
current with output voltage doesn't et a 
bother you. If this voltage dependence is . = = 7 
too large, you have the choice of Fig. 3-19: Multiple current mirrors 
increasing the gate lengths, adding in MOS. 
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resistors in all sources or - you guessed it - add a few devices. 
To reduce the influence of the output voltage we could use the 
Wilson current mirror, as discussed above. But 


MOS devices cannot take advantage of one of its 
features, the cancellation of base currents. For jo: 2 3 
this reason the Wilson current mirror is not the a ieee 
best choice for MOS, a simple cascode stage has | | 
slightly better performance and can be made to Sa weiop | Wet0h 
. . L=0.35u L=0.35u 
have a wider range in the output voltage. ' ‘ 
Here M3 and MS simply shield M2 and ‘| ni It 
M4 from changes in the output voltage. Their te we wet 
gates are held at a voltage slightly higher thanthe =F" SS) StL 
threshold voltage of M1 by causing a voltage - - 
drop in R1. In our case here this bias voltage Fig. 3-20: Current mirror with 
is 500mV, which results in quite a remarkable cascode transistors. 
49. 50. 
49.8 50. 
49. 5 
49.7 49. 
q < 49 
S497 = 
i] 2 49. 
49.6 49. 
49. 4 
49.5 48. 
48. 
0.5 1 1.5 2 2.5 0.5 1 1.5 2 2.5 
Drain Voltage/V 500mV /div Drain Voltage soomvicly 
Fig. 3-21: Performance of cascode Fig. 3-22: Figure 9-21 repeated with Monte 
MOS current mirror. Carlo variations. 


performance, but requires at least 0.7 Volts at the output. Lowering the 
voltage drop across R1 lowers this minimum output voltage, but increases 
the voltage dependence, which you can reduce again by using even larger 
devices. 

Again, don't get carried away by the impressive performance shown 
with a single sweep, which assumes perfect matching. A Monte Carlo run 
will show you the true behavior. 

For CMOS Current mirrors there are three more sophisticated 
schemes. Figure 3-23 is the one you frequently see in articles. M1 is a thin 
device, producing a bias voltage about 100 to 200mV higher than the gate 
voltages of M3 and MS. Since the gates of M2 and M4 are connected to 
this point, these two devices act as cascodes, i.e. they shield the lower two 
devices from voltage changes. 
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M1 M2 


L=2u  L=2u 
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Fig. 3-23: Widely 


used mirror. 


With the dimensions shown, 
the circuit in figure 3-24 has exactly 
the same performance with fewer 
devices and less current 
consumption; here the lower devices 
are dimensioned so that the gate 
voltage (at 50uA) has the required 
value for cascode biasing. The upper 
devices are then made wide enough 
to leave a comfortable margin 
between their source potentials and 
the "on" voltage of the lower 
devices (i.e. the voltage drop 


caused by channel resistance). This is a prime example 
how much you can do by simply changing the channel 
length and width of a CMOS transistor. 

The circuit in figure 3-25 has the best performance. 
The cascode bias voltage is set not only by the device 
dimensions, but by the small (250mV) voltage drop across 
R1. Since the current itself is almost certainly determined 
by a resistor, R1 will track it. The flatter curve represents a 


Vv 
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~We-tou W-t00 


L=5u L=5u 


Fig. 3-24: Fewer 


devices, same 
performance. 


we higher output impedance (100MOhm for 3-25, 33MOhm for 
ona: we BG the others) which is oe 
a ri pea eats high 4-28 
L gain in active : 
Ye vat oads. ss a 
All the a 
bea figures given here i 
Best x 3}25 
Performance. are for room = 
temperature only. — 
The threshold voltage, the resistor 4.97 
and, in bipolar designs, the VBE 408 
have temperature coefficients. 49.95 
Make sure you simulate your os 1 12 14 16 18 2 22 24 26 28 


circuit over the entire temperature 


range. 


Drain Voltage/V 


200mV/div 


Fig. 3-26: Comparison of Performance. 
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4 The Royal Differential Pair 


Open any analog IC and you will find a differential pair. Or, more 
likely, a half dozen. It has great advantages, even if amplifying a 
"difference" is not even a goal. 

The reasoning is simple: Individual integrated components have 
large variations, but two (or more) of the same match very well. If you can 
take advantage of the matching, you get better performance. 

It isn't always true, of course. Noise, for example can be smaller in 
a single-transistor stage and some of the most ingenious designs are 
remarkably free of the common differential stage. But let's look at this 
wondrous tool. 

. Two transistors - here bipolar - share a 
common emitter current. If the voltages at 

2 3 their bases are equal and the two transistors 

veo Match perfectly, I1 is split into two equal 
—Kar a2 ©); parts at the collectors, [2 and I3. 

If we increase V1 (relative to V2), Q1 
gets more of the current than Q2. If we 
© : \8 , Oa j=sus_ decrease V1, the opposite is true. 

But there are limitations and errors. 
First of all, the current division (or input 

Fig. 4-1: In adifferential pair | VOltage to output current relationship) is not 
a current is divided by two linear. We are dealing with two base-emitter 
italisislats: diodes here, fundamentally exponential 
devices. Not counting stray effects the emitter resistance is: 


_ k*eT 
~ gk ie 


e 


where k = Boltzman constant (1.38E-23 Joules/Kelvin) 
T = the absolute temperature in Kelvin 
q = the electron charge (1.6E-19 Coulombs) 
Ie = the operating current through each emitter 


This expression amounts to about 26 Ohms, at room temperature 
and with a current of ImA. If Ie drops to 100uA, re becomes 260 Ohms. 
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Since it is very much a function of current, r, is called the dynamic emitter 
resistance. The conversion from base voltage to collector current, the 


transconductance, is 


where R, is the ohmic resistance of 
the emitter, i.e. the resistance 
between the emitter contact and the 
emitter-base junction (usually a 
few Ohms). 

As the current moves from 
one transistor to the other, both 
emitter resistances change and we 
get a rather non-linear behavior. 
Only a small portion of the curve 
in the middle, when the two current 
are equal or nearly equal, could be 
called linear, though in truth it too 
is not a straight line. 


-50C 125C 


12/ pA 
=> np wo sb a for) Nn a o 


18 #185 19 1.95 2 2.05 2.1 215 2.2 


Input Voltage/V 50mV/div 


Fig. 4-3: ...and also temperature- 
dependent. 


HA 
=> y © FB a ®@N ww 


V1V/V 50mV/div 


Fig. 4-2: The conversion from input 
voltage to current (the transconductance) 
is non-linear..... 


The other variable in the 
equation is temperature. The emitter 
resistance is proportional to absolute 
temperature, so at high temperature 
you get less gain or transconductance. 

There are also three sources of 
error to be considered: 1. A small 
portion of the emitter current comes 
from the bases, not the collectors. 
With a minimum hFE of 100 this 
makes the sum of the collector current 
smaller than the emitter current by 
1%, 2. Transistors only match well if 
the are treated identically. In this case 
that specifically means the collector 


voltages have to be the same. If one is higher than the other, its transistor 
will have a higher gain because of the Early effect; 3. Devices never match 
perfectly; there will be some differences in both VBE and hFE and thus 
some uncertainty in the voltage at which [2 and I3 are equal, showing up as 


an offset voltage. 
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t A differential pair using MOS 
transistors behaves almost identically, but 
i2 3 ei for entirely different reasons. There is no 
dynamic emitter resistance; the gain is 
wi a. determined directly by the trans- 
L=0.35 L+0.35u conductance. This transconductance is also 
Ow | Q ; eu leer increasing drastically with 
g gate voltage. Whereas in the 
ane gis eee | bipolar transistor size is only of second- 
Fig. 4-4: MOS differential pair. order importance, transconductance in an 
MOS transistor is directly proportional to 
gate width and decreases with increasing temperature. 


125C| 


4 
s £ 
Rid 
= 

1.8 1.85 1.9 1.95 2 2.05 21 2.15 2.2 18 7.85 19 1.95 2 2.05 24 245 22 

vay Sova viv 50mV/div 

Fig. 4-5: A CMOS differential pair is Fig. 4-6: ...and the transconductance 
also non-linear ..... also has a temperature coefficient. 


Notably absent in the error sources is any kind of input current; [2 
plus I3 are indeed equal to I1. There is, however, an offset voltage and, for 
equal sizes, MOS transistor have a larger offset voltage (1.e. mismatch) than 
bipolar ones (about 2:1, but this depends 
greatly on the process). Remember you 
can always improve matching (for any 
device) by increasing size (i.e. total area), 
preferably by using multiple small 
devices. 

Let's get back to bipolar transistors 
and complete the differential stage. We 
can simply use the two collector currents 
to create voltages across two resistors. In 
Fig. 4-7: A complete differential this example the voltage at the base of Q2 

amplifier. is held constant - it is simply a DC bias 


2VvDC, 


50mVpAC 
“Ke 
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voltage large enough to overcome the base-emitter diode voltage (assuming 
that there is a single 5-Volt supply). Vin, going to the other base, carries 
the same DC bias level and has a 50mVp AC signal superimposed (i.e. it 
moves from 1.95 Volts to 2.05 Volts). The gain of this stage is determined 
by the ratio of the resistors to the dynamic emitter resistances. At 50uA (for 
each transistor) r. is 520 Ohms (26Ohmsx1mA/50uA). The gain from the 
inputs (measured differentially, which in this case is simply Vin) to the 
outputs (again measured differentially, i.e. Out] + Out2) is 80k/1.04k or 77. 
The gain to only one output is half of that. 

It's not a great deal of gain and it cannot be made any larger by 
simply increasing the values of Rl and R2. There is a DC voltage drop of 2 
Volts across them, if we were to double their values the transistors would 
saturate. 

In reality the gain is always lower than obtained by this simple 
calculation, which does not take into account the ohmic (i.e. access) 
resistances in the emitters or the fact that a small percentage of the emitter 
current is lost to the base. 

Even with only 5|0mVp ai ae 
input there is already significant Pa teak 
distortion, about 5%. We can improve ne ‘ 4 Joute 
this by connecting resistors in series with — somvpac oe 
each ae This makes the total me 7 © 
emitter resistance more linear, which vin 5 3 R4 | tas 
drops the distortion to less than 0.1% oe ~ testis 
(with 50mVp input). But the gain has | i . 
suffered badly: less than 16 with a = a SS 
differential output, less than 8 single- 
ended. 


Fig. 4-8: Linearized differential 
amplifier 
‘ There is a competing approach: 
40k agi two separate emitter current sources (or, 
more precisely, current sinks) of half the 
Phen es value and a single resistor of twice the 
value connected between the emitters. If 
ak you take a poll among analog IC designers 
Ci |S I |S @ about half of them will swear that one is 
eau sou ] ? better than the other. But in fact the two 
= = = = = circuits are identical in performance. 
Fig. 4-9: A connection different To get more gain we need a better 
from that of figure 4-8 but identical | scheme for the output. In the vast 
in performance. majority of applications an amplifier 
needs only one output. Thus it is no loss 
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if we convert the differential signal to a single-ended one in the very first 
stage. And, with a current mirror, the benefits are immediate. 

If one NPN collector current is 
mirrored by Q3 (here a split-collector 
lateral PNP) it opposes the current of the 
second collector. With no input signal 
(and perfect matching) the two are equal, 
they cancel each other. But with an input 
signal one increases, while the other one 
decreases (hopefully by the same 
amount), so that we only see their 
difference at the output and we can use a 
much larger output resistance. As drawn, 
the gain of this stage is 278. And since 
the gain is so large, we get quite a large 
output signal with only 1mVp at the 
input, which makes the distortion 
reasonably small. Q3 is called an 
active load. 3 

But there are two things wrong 
with the circuit in figure 4-10. First, if 
you were to specify a 250kOhm - 
resistor in an IC you might be 
suspected of lunacy; its size would oe es 
take up more space than all the other Fig. 4-11: Output signal with a 
components together. Second, if you 1mVp sine-wave input. 
look at the output waveform closely, you 
notice that there is a DC current flowing through R1. At the right end we 
connect it to a 3-Volt bias point, but at the left end the center of the sine- 
wave is not 3 but 3.25 Volts, i.e. we have a built-in offset. There are two 
reasons for this: |. the two collectors of Q3 are not at the same potential and 
2. the collector current of Q1 has to supply the base current for Q3. 

We need something like R1 to fix the DC potential at the output. 
The two opposing collectors are current sources/sinks. The smallest 
difference between the two would cause the output potential to move up so 
much that Q3 would saturate or down so much that Q2 would saturate. 

In figure 4-12 all of these problems are fixed at once by adding a 
second stage. Q4 is the same size as Q3 and is operated at the same current. 
Now the collector voltages of both Q1/Q2 and Q3 are identical. Moreover, 
the collector current of Q2 has to supply the same amount of base current 


Fig. 4-10: Differential amplifier with 
an active load. 


3.! 


3.4 


Out/V 


0 02 #04 O06 08 4 1244 16 18 2 
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(for Q4) as the collector of Q1 does. In other words, with no input signal 
the circuit is perfectly balanced; there is no built-in offset. 

T But you need to be careful here. 
Ke lve The gain of this circuit is no longer fixed 

}_____k 0% by a resistor ratio, it is dependent on 
transistor parameters. If these two stages 
—Kar a2}— ica are made part of an operational amplifier, 
the feedback will take care of this. Or, if 

Vint 


©" |B a on le ce L the circuit is used merely as a comparator, 
2 °v8 gain is of lesser importance than offset. 


Fig. 4-12: High-gain, balanced 

differential amplifier. 

If you simulate the circuit 
as shown, you will get a different 3. 
and rather odd output curve. 
Current sources in a simulator are 
ideal devices, they will do anything 
to supply the exact amount of 
current, which includes supplying jue toe eee ee 
their own voltage (if necessary smite miata 
thousands of volts). An actual 
current sink such as I2 will 
collapse near ground, but the ideal 


Out /V 


Fig. 4-13: Transfer curve for circuit in 
figure 4-12. 


one keeps right on working 
; 4 down to a very large 

4. negative voltage. For this 
| reason there was an 
| additional device in the 
3. simulation diagram, a diode 
from Vin2 to Out which 
clamps the output swing at 
2. the low end. 

The last circuit may 
be perfectly balanced but, as 
in any circuit, the matching 
of the devices is still subject 
to variation. If we run a 

Fig. 4-14: Even a perfectly balanced differential Monte Carlo analysis we 
amplifier (figure 4-12) has an offset voltage due to . 
meet the real world: the 


mismatch. 
random offset voltage. 


Out / V 


1.999 1.9992 1.9994 1.9996 1.9998 2 2.0002 2.0004 2.0006 2.0008 


Vint/V 200pV/div 


Edition February 2005 4-6 All rights reserved 


Camenzind: Designing Analog Chips Chapter 4: The Royal Differential Pair 


We wat t Figure 4-15 shows the 

| Ms ve same design in CMOS. Note that 
ene Ges () M3, M4 and MS are all the same 
as size, thus balance is achieved with 
- ‘ie 12 having a magnitude of one-half 
| = | Il. Here, of course, we are not 
wa10i_| =i concermed about cancellation of 


C) | oa O), Ie" |. base currents but identical gate 


voltages are still important. 

The random offset voltage 
is of greater concern in a CMOS 
design. MOS transistors match 
less well than bipolar ones. That 
has been true since the start of the IC industry. It is not that an MOS 
transistor is inherently inferior in this respect, but that matching, 
specifically the offset voltage is based on different process parameters. For 
the bipolar transistor matching is determined by the depths of diffusions, 
particularly the base and emitter. The dimension having the greatest 
influence on offset voltage in an MOS transistor is the gate insulator 
thickness. While control has steadily increased, the insulator thickness 
needed to be steadily decreased to get sufficient gain for the ever smaller 
devices. The gate insulator thickness has by far the smallest dimension in 
an IC and thus continues to create fluctuations in threshold voltage larger 
than a diffusion will cause in VBE. 


Fig. 4-15: Balanced CMOS differential 
amplifier. 


Two more additions. The base current of a bipolar transistor is a 
disadvantage. If its operating current is say 25uA and the minimum hFE 
100, the input draws (or supplies in the case of a PNP transistor) as much as 
250nA. We can decrease this with a 


Darlington configuration. T 
Q3 and Q4 carry the base current _ : 
of the differential pair. At their bases the 
input current is reduced by a factor of jas yor Wg 
another hFE. Thus the input current is Ph 4 ° 
lin = [1/2(hFE)? me il cal 
or 2.5nA. There is a price, though: Cc 4 r " a 
1. The input voltages need to be 100u 2. i: 


higher by a VBE so that there is enough 7 = + 
headroom for I1; 

2. Q3 and Q4 run at very low 
current, thus their speed is bound to be 


Fig. 4-16: NPN Darlington input 
stage. 
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rather slow; and 

3. the leakage currents of Q3 and Q4 run into the bases of Q1 and 
Q2, showing up multiplied by the hFE of the latter two in I2 and [3. This is 
a danger at high temperatures (say above 90°C). 

Switching time and leakage current can be reduced with small 
currents from the emitters of Q3 and Q4 to ground, in effect running the two 
transistors at a higher operating current. But of course you can't go too far 

in that direction: the input current 
vee increases again. 

\Q™ If you invert the circuit and use 
(lateral) PNP transistors at the input, 
you gain an advantage which is often 


J useful: one input can be at ground. 
| ae uM - There is enough headroom for the 
ae oe current mirror (Q5 and Q6) even if the 
a input is 200 or 300mV below ground. 
*OsuB The limit here is one diode drop below 


== === ground, at which point the base of Q3 

Fig. 4-17: A PNP Darlington input will forward-bias against the substrate. 
stage allows the input to move below The same limitation in speed and upper 
ground. temperature apply. 


The Darlington Pair 


Sidney Darlington was born in Pittsburgh, Pennsylvania, in1906 and joined 
Bell Laboratories in 1929, where he remained until his retirement 42 years later. He 
was a theorist who also liked to tinker with circuits. 

In 1952 silicon transistors made at Bell Labs had low gain (5 to 15). 
Darlington checked out two of them (only a few were available) and experimented at 
home over a weekend. He found that by connecting the emitter of one to the base of 
the other, the gain would be the product of the two, 25 to 225, a much more useful 
range. He then suggested a method fabricating the pair out of a single block of 
silicon (with a common collector), thus coming very close to the idea of a monolithic 
integrated circuit. Bell Labs was issued a patent (2,663,806) in 1953. 

Darlington died in 1997 at age 91. 
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5 Current Sources 


Ever since the dawn of analog IC design (all the way back in 1962) 
a succession of very clever people have been trying to conjure up 
something that would produce an accurate current. The results have been 
uniformly dismal. 

There happens to be a capable voltage source in ICs, the bandgap 
reference (which we shall get into next). So, to get a current, one would 
think, all one needs is an accurate resistor; after all I= V/R. But, unless you 
want to add a costly thin-film layer and laser trimming, there are no 
accurate resistors. What we get are resistors made from diffused or 
deposited silicon layers which vary in resistance from wafer to wafer and 
have a considerable temperature coefficient. 

So, don't expect any precision here. At best, an integrated current 
source can provide a small current without the use of large-value resistors 
and make this current more or less independent of the applied voltages. 


Current Sources with Bipolar Transistors 


The first example uses a diode-connected — 
transistor (Q3) as a reference voltage. A primary c i 
current flow through R2, Q2 and Q3. The base of Q1 is 
at two VBE (base-emitter or diode voltage), thus its | 
emitter has a potential of one VBE. The current aa} + Kar 

through Q1 is thus C1 Se 
2 VBE/RI. Ifwelet a3 $ 4ox 
the voltage at the *OsuB 
4 collector of Q1 = = 
2 (the destination of Fig. 5-1: Current 
=! the current) drop ETE me 
below about | 
Volt, QI saturates 
and draws from the primary current. 


0 


But above 1 Volt the current is very 


Collector Voltage Q1/V 1V/div 


constant, changing less than 0.3%. This 


Fig. 5-2: 11 vs. output voltage. Page 
. P ? quality is best expressed as output 


Edition February 2005 5-1 All rights reserved 


Camenzind: Designing Analog Chips Chapter 5: Current Sources 


impedance, i.e. the change in voltage (4 Volts) divided by the change in 
current (about 53nA). Thus the output impedance of this circuit is about 
75MOhm. Not bad for using only 80kOhms in resistance. 

A diode has a negative temperature coefficient and a (diffused) 
resistor a positive one. The two combine to give the current a strong 
negative temperature coefficient, a 
change of about -29% from 0 to 
100°C. 

The VBE is of course no Zener 
diode, it varies a bit as the current 
changes, which makes the current 
dependent on the supply voltage (a 
+2% increase as the voltage moves 


1 
1 
1 
1 
1 


I1/ pA 


from 4.5 to 5.5V). \ | | | | 
And then there is the variation Supply Votage/V tviaiv 

in production: + 28%, mostly caused 

by the variation of R1. Changes with Fig. 5-3: 11 vs. supply voltage. 

temperature and supply voltage must 

be added to this figure. 


Also be aware that we are wasting some current: it takes 90UuA 
through R2 to produce 20uA in Q1. 


A word about the choices in the examples of this chapter: 

e For bipolar circuits the use of (base) diffused resistors is assumed 
with an absolute variation of + 25%. This is probably the largest variation 
you will encounter; CMOS foundries often guarantee smaller variations, 
especially for poly resistors. 

e Each current source produces about 20uA, an arbitrary choice 
made to allow comparison. 

e Strictly speaking these circuits are current sinks, not sources. To 
make a circuit in which the current is delivered from the positive supply, the 
design is turned upside-down, NPN transistors are made PNP and N- 
Channel ones P-Channel. 

e Supply voltages are arbitrarily selected from 5, 3 and 1.8 Volts. 

e As before, the fourth (bipolar) transistor terminal is hidden; all of 
these terminals are connected together to the most negative supply voltage 
with the symbol SUB. This avoids cluttering up the schematic. For MOS 
transistors the connection is left visible as a choice needs to be made for the 
P-Channel device. 
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On to the second example, a rare case where better 


eee performance is achieved with fewer devices. Through 
Rt oy feedback Q2 regulates the current of Q1, holding I1 more 
oe constant. In this way the output impedance increases to 
< 500MOhm, the Monte Carlo variation decreases to + 26% 
and the change with supply voltage from 4.5 to 5.5V to 
a2>}—+ +1.8%. But the voltage at the collector of Q1 still must not 
2 drop below about 1 Volt. 
fesua 7“ In both of these current sources the emitter of the 
— eS output transistor is sitting on top of one VBE, about 0.65V 
Fig. 5-4: at room temperature (and higher at low temperature). For 


Improved VBE 


current source.  10W-Vvoltage ICs we need a design in which this emitter is 


at or very near ground. 


4 
4 
0 1 2 3 4 0 1 2 3 4 5 
Collector Voltage Q1/V 1Vidiv Supply Voltage/V 1Vidiv 
Fig. 5-5: 11 vs. output voltage of figure 5-4. Fig. 5-6: 11 vs. supply voltage of figure 5-4 
Figure 5-7 looks like a current mirror, but it isn't. ve 
There is a deliberate mismatch between the two 
transistors. They get the same voltage at their bases but, ie 
while Q2 has a straightforward base-emitter diode to 
ground, the path for Q1 consists of a lower diode voltage go at 
(because of the larger area using three emitters) and a 
resistor. The difference in voltage between the two RI 
diodes is: sus? 3.75k 
k*T Al* 12 Fig. 5-7: Delta-VBE 
deltaVBE = *In current source. 
A2* I] 


where k = Boltzman constant (1.38E-23 Joules/Kelvin) 
T = the absolute temperature in Kelvin 
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q = the electron charge (1.6E-19 Coulombs) 
Al = emitter area of Q1 
A2 = emitter area of Q2 
12 = current through Q2 


k*T/q amounts to about 26mV at room temperature and [2 is about 110uA. 
Thus, with a desired I] of 20uA, the voltage drop across R1 is 
26mV*1n(16.5), i.e. 11 = 72.9mV/3.75k = 19.4uA. Note that delta-VBE is 
independent of current, only the current ratio is important. 


1 /uA 


° 1 2 3 4 0 1 2 3 4 5 


Collector Voltage Q1/V 1V/div Supply Voltage/V 4Vidiv 


Fig. 5-8: 11 vs. output voltage of figure 5-7. Fig. 5-9: 11 vs. supply voltage of figure 5-7. 


The voltage at the collector of Q1 can now go lower, to the 
saturation voltage of the device plus the delta-VBE. There is little change 
in current as the voltage at the output is moved (amounting to an impedance 
of about 12MOhm) but dependence on supply voltage is quite large, a 
+6.5% change as Vcc moves from 4.5 to 5.5V. 

Since delta-VBE is proportional to absolute temperature (PTAT), I1 
has a marked positive temperature coefficient, moderated only slightly by 
the positive tempco of the resistors. Production variation (at a fixed voltage 
and temperature) is + 26%, dominated by the variation of R1. 

The performance can be improved slightly by using a larger device 
ratio. With Q1 having 10 emitters the output impedance increases to 
15MOhms and the voltage dependence to 4.5% (4.5 to 5.5V). 


The Quality of a Current Source 

An ideal current source maintains the current level no matter what happens 
at its terminals, which results in an impedance that is infinite. 

A practical current source can approach this over a limited voltage range, 


with an impedance of up to tens of Meg-Ohms (i.e. there is very little change in 
current as the voltage across the current source changes. But its absolute level is 
subject to (absolute) parameter variations, which are large in an integrated circuit. 
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You can also improve this performance with a 
simple measure. Suppose you connect the base of Q1 
not to the base of Q2, but to a point which 
counteracts a rising Vcc. By inserting a small 
amount of resistance in the collector path of Q2 we 
get a node whose voltage is fairly constant. The 
voltage at the base of Q2 still increases somewhat as 
Vcc is increased, causing its collector current to 
increase. But this makes the voltage drop across R3 
increase and, with just the right value, the base 

aA 7 voltage for Q1 changes little, at least over the critical 
Fig. 5-10: R3 reduces : 
supply-voltage range in supply voltage. 
dependence. Note 
that, because 
of the lower base voltage, the value 
of R1 is lower for the same amount 
of current. The easiest approach to 
circuits like Figure 5-10 is 
simulation. Just try various values 
for R1 and R3 until you get the right 
current with minimal change. But , 


it / pA 


in the layout, make these resistors | a ee ee ee ee ee me 

fairly wide; you are counting on ghpaiy Volsaen sum av 

matching. Fig. 5-11: 11 vs. supply voltage with R3 
The change in I] is now a mere optimized for the range 3 to 3.6 Volts. 


+0.5% with Vcc varying between 3 
and 3.6V (and even lower with a 4.5 to 5.5V range). 

The temperature coefficient for this circuit is somewhat larger: a 
+31% change from 0 to 100°C and the output impedance drops to about 
7TMOhms. Production variation is unchanged. 


We are about to take a rather daring step. The primary current in the 
previous circuits is a nuisance; it wastes power and takes up considerable 
resistance. Why not replace it with a current source derived from the 
current the circuit generates? 

There is one flaw in this argument: the current must exist first. 
There are two possible modes, one in which the current levels are as 
intended and one where there are no currents at all. In other words, there 
must be a current in Q2, which can be mirrored and fed back to QI and the 
base of Q2 so Q2 can have a current, etc. 
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The usual solution is to employ a start-up circuit, designed to bring 
Q2 to a level sufficient to sustain the loop. The start-up circuit then shuts 


down and has no further influence. 


But there is another way: leakage 


Vec 
ea currents. Q2 has two leakage currents, from 
collector to substrate and from collector to 


H base. These currents may be small (pA), but 
By ~ they are mirrored by Q3 and fed back into the 
uals a4 base of Q2, where they are amplified. And so 
a a it goes around the loop, eventually reaching 
R2 R3 microamperes. 
Tsu a eee Two factors must be understood here. 
om mee First, we are not talking about a leakage 
Fig. 5-12: Self-starting current caused by dirt. The very small reverse- 
current source without junction currents measured in today's IC 


large-value resistors. 


nothing to do with cleanliness. 
Second, the design and the 
process must allow these small 
currents to grow. If, for 
example, there is a path from 
the base of Q3 to Vcc or the 
bases of either Q1 or Q2 to 
ground which can shunt 
leakage (provided say by a very 
large, reverse-biased junction), 
the scheme won't work. If your 
models are accurate, trust the 
simulation. Use a Monte Carlo 
analysis to see if the circuit 
starts up every time and do this 
at temperature extremes where 


devices are fundamental phenomena and have 


11 /pA 


t 


0 1 2 3 4 5 


Supply Voltage/V 1V/div 


Fig. 5-13: 11 vs. supply voltage for figure 5-12. 


leakage currents are either at their lowest or highest. 

R1 has been added to counteract the remaining dependence on Vcc 
(caused by the Early effect in Q2 and Q3). With that we get a change of 
+0.4% as Vcc is increased from 4.5 to 5.5V. The circuit can have a supply 
voltage as low as | Volt and the voltage at the output can be as low as 0.3V. 


Lastly, the Erdi current source, a very clever design with an 
astonishing performance. We start with an auxiliary current, aux. And 
before you even have a chance to sneer at the fact that a current source is 
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vec used to make a current source, let me point out that 
the accuracy of this current source is of no great 
importance. A bulk (epi) pinch resistor will do or 
any of the lesser current sources discussed above. 
Qs Taux is mirrored and split into two equal 
parts by QI and Q2; thus the operating currents for 
Q3 and Q4 are equal. Q4, however has 3 emitters, 
Q3 only one, thus there is a difference of about 
29mV (at room temperature). Unbalanced, the 
collector voltage of Q3 rises until Q5 supplies 


Fig. 5-14: Erdi current enough current to make up the difference. This 
source. 


current amounts to (delta- 
VBE)/R2. ' 
Vcc can be as low as | Volt 
(or as high as breakdown voltages vs| Supply Volthge 
allow). Moving Vcc 20% changes Ws. Output Voltage 
I1 by 0.08%. The output 
impedance is 5S0MOhm. 
Temperature is strongly positive, a 
+25% change from 0 to 100°C. 
And the Monte Carlo variation is 
roughly that of R2, here assumed pe ae ae ae he ee 
to be + 25%. —_ —_ 


Fig. 5-15: Performance of the Erdi current 
source. 


1 /pA 


CMOS Current Sources 


None of the bipolar schemes work well for CMOS devices. They 
are based on VBE or delta-VBE , for which there are no equivalents. 
Trying to use circuits such as Figures 5-10 or 5-14 with CMOS width-ratios 

1 leads to inferior circuits. Also, due to the square-law 
behavior of the gate voltage, the variations are roughly 
double those of bipolar designs. 

Fortunately the CMOS transistor is a current 
source. If we simply apply a constant voltage to the 
gate (such as a reference voltage) we can tailor the 
width and length of the device to give us a certain 


= =" _ current. 
Fig. 5-16: MOS Using a rather exaggerated length, we can 
transistor as a minimize the channel-shortening effect. In the example 


current source. 
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here the output current varies little 
with the applied voltage, amounting to 
an impedance of 383MOhm. But the 
variation (due to the uncertainty in the 
threshold voltage) is large: + 39%. 
Add to this the change with 
temperature (0 to 100°C, -23%) and a 
variation in the reference voltage (+ 
3% causes a change of 10% in I1). 


11/pA 


Output Voltage/V 500mV/div 


Fig. 5-17: 11 vs. voltage for fig. 5-16. 


The Ideal Current Source 


Sometimes a compromise is the best solution. If we allow just one 
component to be external to the IC and provide a pin for it, the 


performance of a current source improves dramatically. All other currents 
i within the IC can then be derived from it with 
current mirrors and are thus inherently 


Hi accurate. 
100mv ls In the last circuit an op-amp compares 
W=5) = . . 
CO hal the voltage across an external resistor with a 
1 low internal reference (divided down, for 
a example, from a bandgap reference) and 


drives the gate (or base) of an output device. 
a Assuming no trimming, a 1% tolerance for the 

external resistor, 3% for the reference voltage 
Fig. 5-18: Accurate current ~~ and 2mV offset uncertainty for the op-amp, I1 
source with an op-amp and 


external resistor. will be within 6% at any voltage and any 
temperature. 
Edition February 2005 5-8 
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6 Time Out: Analog Measures 


dB 


Analog scales tend to be very large. As an example, the hearing 
threshold of a young adult is 20 micro-Pascal; the maximum level without 
damaging the ear can be more than 20 Pascal, a ratio of one to 1 million. 
Particularly because of the widely varying sound levels a need for a 
logarithmic measure appeared early on in electronics. There are two of 
them: The Neper is based on the natural logarithm and named after John 
Napier, a 16th century Scottish mathematician who came up with the 
logarithmic table (and whose name was most likely spelled Neper in his 
time). 

In the 1920's a measure based on logarithm with base 10 began to be 
used at Bell Laboratories. At first it was called the "transmission unit", then 
re-christened the Bel, after Alexander Graham Bell. The idea is simple, a 
Bel is the logarithm of two power levels: 


Pi 
Bel = log — 
P2 


But the Bel turned out to be a bit coarse; one-tenth of that suited the 
Bell Labs people better, hence the decibel, or dB: 


dB = 10* log = 
P2 


This is the ratio of two power levels. Since power is related to the 
square of the voltage (or current), we get: 


1 
dB = dle 
v2 


Neper has more or less disappeared as a measure, dB proved to be 
more convenient. But, when using dB, always keep in mind there is a 2:1 
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difference between the ratios of power and those of voltages or currents (or 
pressure). 

Even though a logarithmic ratio is very convenient, it is helpful to 
picture the (voltage) actual ratios: 


-60dB_ 1/1000 
-40dB_ 1/100 
-20dB_ 1/10 
-6dB_ 0.5 (exactly: 0.5012) 
-3dB 0.707 
OdB 1 
20dB 10 
40dB 100 
60dB 1000 


Fundamentally dB is relative, expressing only a ratio. But there are 
many modifications in which one of the two levels are absolute, among 
them: 


dB(SPL) Sound pressure level, where the hearing threshold (0dB) 
is based on 20uPascal. 


dBm Power ratio where 0dB = 1mW. Originally this was based 
on an impedance of 600 Ohms (that of a telephone line), 
but now is used with any impedance (which is fine as long 
as we calculate power ratios, not voltage ratios). 


dBuV Voltage ratio relative to 1uV. 


RMS 


RMS calculation was introduced by Charles Steinmetz more than 
100 years ago. Steinmetz grew up in Breslau, Germany (now Poland), but 
shortly before he got his Ph.D. in mathematics and physics he had to flee to 
Switzerland because of his socialist activities. From there he emigrated to 
the U.S. and found a job as an assistant draftsman in Yonkers. The company 
fabricated hat-making machinery, but soon expanded into electrical motors. 
The year was 1889. 

Steinmetz was a small man with a hunchback and one leg shorter 
than the other, a deformity he inherited from his father. Though he made the 
proverbial bad first impression, the people around him soon were in awe of 
his razor-sharp mind. No surprise then that the overqualified draftsman was 
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at the forefront in AC engineering within four years. Through mergers and 
acquisitions he found himself to be working for General Electric as head of 
the calculating department in Schenectady and teaching at Union College. 
He led a bohemian life; afraid to marry because of his inherited deformity, 
he shared his house with an entire family and kept several crows, raccoons, 
eagles, owls, squirrels, dogs and alligators. He resumed his socialist 
activities, expounding his ideas in a book; he was against competition and 
advocated an industrial reorganization by the government. It is remarkable 
that he got along very well with his bosses at GE. In all respects he was a 
delightful man who seemed to have a very happy life. Almost single- 
handedly he moved electrical engineering from a craft to a profession. 

Steinmetz found that few "electricians" used mathematics, his 
specialty. The first curriculum in electrical engineering had started at MIT 
in 1882 and very few people understood AC. George Prescott lamented in 
1888: "It is a well-known fact that alternating currents do not follow Ohm's 
law, and nobody knows what law they follow." 

For example, there were two things wrong with the "average" value 
meters of the time displayed: in the first place the true average of an AC 
waveform should have been zero; in the second place the product of the 
average current and voltage gave the wrong answer for power. Also, a phase 
shift between voltage and current left almost everyone perplexed. 

In 1893 Steinmetz presented his first paper on the use of complex 
numbers in electrical engineering. It was heavy going, delivered in a thick 
German accent. But he kept at it in paper after paper and then a massive, 
three-volume text book. By 1901 he had it down pat and published a 
textbook that was finally easy to understand. 

What he said was this: In order to calculate the power correctly, you 
need to square the voltage (or current), calculate its mean (average) and 
apply the square root. Hence RMS - root-mean-square. Or a bit more 
detailed: You divide the waveform into equal segments over one period, 
square each segment, add up the squared values, calculate the average and 
take the square root of that. 

The power, by the way, is the average power. There is no such thing 
as RMS power, only RMS voltage and RMS current. 

For a pure sine-wave this works out as: 


_ Vpeak 


Vrms = 0.707* Vpeak 
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Here is a simple illustration of RMS 
calculation: Four time segments of 
100usec each. First a voltage is at 5 
Volts, then a zero, then at 2 Volts and 
finally at zero again. 


Ss 


Voltage / V 


2 2 2 2 r 
Vrms = aaa = 2.69V 


0 50 100 150 200 250 300 350 


Time/uSecs 50uSecs/div 


But the RMS calculation has Fig. 6-1: Arbitrary waveform for RMS 
some limitations: it doesn't work with calculation. 
non-linear elements. In Steinmetz's 
time there were no transistors, not even vacuum tubes. There were only 
linear elements (save perhaps for the occasionally saturating transformer), 
so he didn't consider what would happen if the impedance changes while 
you are measuring RMS voltage and current. 

Take the case of a transistor stage, either linear or switching. You 
want to determine its power dissipation, so you measure the current through 
it and the voltage across it. But the impedance of the transistor constantly 
changes and Ohm's law doesn't hold. The product of RMS voltage and RMS 
current gives an absurdly wrong result for power. The only way you can 
determine the power is to integrate the instantaneous values of voltage times 
current. In a simulation, Spice does this very well. 

But measurements aren't nearly so easy: "True RMS" instruments do 
indeed have a circuit element which measures RMS rather than average. But 
the inputs to almost all of them are capacitively coupled. If the waveform 
you are measuring has a DC component, it is ignored and the result reflect 
on the AC portion of it. 


Noise 


Imagine a current flowing through a wire connected between a 
negative terminal on the left and a positive terminal on the right. Through 
the wire are flowing millions of electrons from left to right. 

Each electron carries a charge of 1.6e-19 Coulombs. Let's say we 
observe a current of 1uA, thus 7.8e12 electrons pass every second. If the 
interval between electron were the same, we would then see a ripple at 
7800Ghz, like the teeth of a saw-blade moving at high speed. 


Edition February 2005 6-4 All rights reserved 


Camenzind: Designing Analog Chips Chapter 6: Analog Measures 


But in a diode or a bipolar transistor, we see quite a different effect. 
Here the current is initiated by electrons and holes moving across a barrier 
and this movement is anything but smooth. Within any given time interval 
one electron or hole may cross the barrier, or 5, or 100, or none. The 
variation is so great that there is no discernable peak at 7800Ghz, or any 
other frequency. In fact, because of the very large number of electrons, this 
white noise or shot noise is so well distributed over the frequency range 
that it has a constant level over the entire spectrum: 


Inoise(rms) = 2-q:I-B 


where q = electron charge (1.6e-19 Coulombs) 
I = de current 
B = bandwidth in Hz 


There are two things you should notice here. First: Noise increases 
as the square-root of bandwidth. Second: When the current is decreased, 
noise becomes a larger fraction of it. 

Let's illustrate the second part. With a bandwidth of 10kHz, 1mA dc 
produces 1.8nA(rms) of noise. That amounts to 0.00017% or -115dB. 

With 1uA of current and the same bandwidth the noise is 56pA(rms), 1.e. 
0.0056% or -85dB. At InA we get 1.8pA of noise, which amounts to 
0.18% or -54dB. All of which shows that it is harder to design a low-noise 
circuit at low current levels. 

There is also noise when no current flows at all. By the energy 
imparted by temperature, some electrons will suddenly leave an orbit and 
jump to another. The higher the temperature, the larger this irregularity 
becomes. Thus a resistor, doing nothing but lying on a bench actually has a 
noise voltage at its terminals: 


Vnoise(rms) = V4-k-T-R-B 
where k = Boltzmann constant (1.38e-23 Joules/T) 


Thus a 1MOhm resistor always has a noise voltage of 13uVrms at 
room temperature, if measured over a bandwidth of 10kHz. This is called 
the Johnson Noise. 

Whenever you mention a noise voltage or current, you also have to 
state the bandwidth. To avoid this, noise voltage is often expressed in 
nV/rtHz (nanovolts per root-Hertz). To get the real noise voltage you 
simply multiply this value with the square-root of the bandwidth. 
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These two noise sources are fundamental, present in any current or 
resistor. But there is another one, not fundamental exactly, but always 
present. It is called 1/f noise or flicker noise. 

Flicker noise is worst in an MOS transistor, which is a major reason 
why bipolar transistors are preferred in analog design. The silicon-oxide 
interface is capable of holding some electrons for a considerable period 
(seconds) and then releasing them in bunches. This increases noise at low 
frequencies far above the white-noise level; at 1Hz the noise level (in 
nV/rtHz) can be two orders of magnitude higher than at 1MHz. 

Flicker noise is also present in bipolar devices, but to a lesser extent. 


Fourier Analysis, Distortion 


Jean Fourier was a mathematician who was active in the French 
revolution. He was arrested twice in the fight between the various factions 
but was spared the guillotine. In 1798 he joined Napoleon's army in the 
invasion of Egypt and then was appointed prefect in Grenoble; in 1809 
Napoleon made him a baron. 

In between his political and administrative duties he found time to 
not only publish a massive work on ancient Egypt but do mathematical 
research. He analyzed the flow of heat in mathematical terms, coming up 
with a novel expansion of functions as trigonometrical series. His memoir 
"On the Propagation of Heat in Solid Bodies" was read to the Paris Institute 
in 1807. His method, now called the Fourier series, was criticized by the 
leading French mathematicians and was not published until 1822 (and not 
translated into English until 54 years later). It turned out to have 

applications in a wide range of areas, 
40 including now electronics. 

When a sine-wave is distorted 
other, higher frequencies are created 
which can be extracted in a Fourier series. 
20 There is an algorithm called "Fast 
Fourier Transform", or FFT, which does 
this. FFT uses an algorithm which allows 

ae ae ED Ee Re fewer computations compared to the 
ig: Be sD Sane sinewave: original discrete Fourier transform. 

In our example here the positive half of a 5kHz sine-wave has been 
compressed. Converted into a frequency spectrum with the help of a 
Fourier transform we see the fundamental frequency of 5kHz and a series of 


Dt-anode / mV 
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harmonics, multiples of the 
fundamental at 10kHz (second 
harmonic), 15kHz (third 
harmonic), 20kHz (fourth 
harmonic), 25kHz (fifth harmonic) 
etc., with gradually decreasing 
amplitudes. The square-root of the 
sum of the squares of all harmonics 
divided by the amplitude of the 
fundamental is the amount of 
distortion. You can usually ne : 
disregard harmonics after the ii ia pee 
fourth or fifth, since their Fig. 6-3: Fast Fourier transform with low 
amplitudes become very small: reeoiHtor 


Amplitude / V 


Harmonics = V36mV? +21mV? +82mV? +09mV? = 425mV 


Fundamental = 550mV 


Distortion = a = 0.077 = 7.7% 
550 


The peak at zero frequency shows the DC level, i.e. the asymmetry 
caused by the clipping. 

Before you run a fast Fourier transform in Spice you need to choose 
two settings: how many samples should be taken over one period of the 
waveform and how many periods should be analyzed. In the example of 
Figure 6-3 there are in fact too few samples and periods, resulting in broad 
peaks. 

As a general rule start with 25 samples and 50 periods. The first is 
determined by "maximum time-step" and "maximum print step" (set at 
8usec in figure 6-4 for a 5kHz driving frequency) and the second by the 
total time in the transient analysis (1msec for 50 periods at 5kHz). You get 
the best results if both the number of samples per period and the number of 
periods are integers. 

The fast Fourier transform has some flaws and limitations. For 
example, figure 6-4 shows peaks in between the harmonics, which, in 
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Amplitude / V 


Amplitude / V 


0 5 10 16 20 25 
Frequency/kHertz 5kHertz/div 
Frequency/kHertz 5kHertz/div 
Fig. 6-4: Fast Fourier transform showing Fig. 6-5: Continuous Fourier transform 
false peaks because of still insufficient with high resolution. 


resolution. 
reality are not there. A superior method is the Continuous Fourier 
Transform, available in some analysis programs and shown in figure 6-5. 
When you have a waveform which is symmetrical but not a sine- 


ey 
3 
=) 
= 


Amplitude / V 
Amplitude / V 


o © 
b& 
So 
Se 
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-0.6 “| 
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Time/mSecs 200ySecs/div Frequency/kHertz 1kHertz/div 
Fig. 6-6: Triangle wave. Fig. 6-7: Fourier analysis of a triangle. 


wave, you get only odd harmonics (i.e. the third, fifth, seventh etc.). Shown 
here is the example of a triangular wave. Total distortion (i.e. deviation 
from a sine-wave) is 12%. 

When you have non-linearity in a circuit and two frequencies are 
present, you also get intermodulation distortion, i.e. not only harmonics 
are created, but differences as well. 
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Frequency Compensation 


Feedback is a wonderful thing. We take the inverted output signal, 
subtract the input signal from it and the amplifier will automatically correct 
and difference between them. If we only feed back a fraction of the output 
signal, the amplifier will automatically adjust its gain to one over that 
fraction. 

Using a single frequency (any frequency), inverting a signal 
(negative feedback) is the same as a 180 degree phase-shift. And here 
comes the problem: Each device in the amplifier has a little bit of delay. At 
low frequency this has little effect, but as we go higher and higher in 
frequency the delay becomes more and more noticeable. At some high 
frequency the delay amounts to half a period of the signal and thus causes a 
phase-shift of 180 degrees. What started out as negative feedback now 
becomes positive feedback and the whole thing oscillates. 

Frequency compensation is a design method which avoids this. The 
principle is very simple: deliberately slow down one device so that it is 
much slower than all others, i.e. it dominates the frequency response so that 
the delay in all other devices is no longer important. 


[- — — Feedback — — This is illustrated with a very 
| 1 | simple simulation. El] is a "voltage- 
In }-p—An—e out controlled voltage source" and acts like 
: a he y an ideal op-amp with a gain of 1 million 
is “Tiga: |) 89 (120dB), has no delay and the input and 
output terminals are free-floating (but 


are referenced here to ground). R1 and 


Fig. 6-8: Abstract circuit to illustrate 
phase-shift in a feedback amplifier. 


Cl cause the single delay (i.e. 
phase-shift). 160 
Due to the RC network = 
the amplitude at the output starts 
decreasing at about 100Hz. At 
this point the phase of the signal 
at the output is considerably less 
than 180 degrees, but as we go 
higher in frequency the phase ‘ ES eee ee 
never goes below 90 degrees. 
Thus the signal being fed back to 
the input cannot reach a phase-shift 
of zero degrees, the condition for 


Phase / deg 
Gain / dB 


Frequency / Hertz 


Fig. 6-9: Single-pole response. The 
phase never goes below 90 degrees. 
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oscillation. 

The point at which the phase has turned by 45 degrees is called a 
pole. At frequencies somewhat higher than the pole the amplitude drops by 
6dB per octave (doubling of frequency) or 20dB per decade. 

Now let's look at the same 


simulation with another RC v2 
network added at the output, with a “ 
pole at a much higher frequency (C i 
= 100pF). We now have two 120 


poles; you can just barely see the 
second pole (at about 10kHz) by 
the change in the steepness of the 


Phase / deg 
0 
Ss 
Gain / dB 


gain curve. The maximum phase- 20 

shift now is 180 degrees. The 0 

point of interest is the frequency at ss cc a a 
which the gain moves through zero pee 

dB (i.e. a gain of 1). It the gain is Fig. 6-10: Two poles in a feedback path 
less than 1 an oscillation cannot approach zero degrees phase-shift. 


sustain itself. While the phase at 
this point only approaches zero degrees, the margin is far too close for 


comfort. 
v2 ‘1 With three poles we are 
ie I Phas clearly out of luck. The phase 
496 id now reaches zero degrees a 
ies eo aain decade before the gain drops 
S a 2 Oselllatioh below 0 dB. An amplifier which 
pa has these three poles will 
én ‘i oscillate, in fact we can tell with 
20 2 certainty that it will oscillate at 
0 200kHz. 
=20 “20740100 1k 10k 100k 1M 10M 


There are now three 
remedies: 1. we can lower the 
Fig. 6-11: Three poles in a feedback path. gain until it drops below 0dB 
Phase-shift goes through zero degrees and before the phase reaches zero 
oscillation takes place. ; : 
degrees; 2. we can insert a new 
pole at a frequency so low that it 
dominates the others and 3. you can introduce a zero. 

To illustrate the effect of a zero, we use another artificial circuit. 
R1/C1, R2/C2 and R3/C3 provide the three poles, delaying the phase of the 
signal, each by the same amount as in figure 6-11. R4, together with C2 
provides the zero, it advances the phase rather than retarding it. The 


Frequency / Hertz 
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frequency (i.e. the value of R4) 


Pole 8 ge Pole Sm POI is selected to result in a 

| eo - a |e frequency where it is most 
ae ees . effective. 

= = |g = At about 30kHz R4/C2 

Fig. 6-12: Three poles and a zero. start turning back the phase, so 
that at the critical frequency 

(SMHz) the gain drops 
below OdB but the phase is 2 z 
still positive, about 15 a veo ase 
degrees, called the phase om , 


margin. Theoretically a 
feedback circuit with this Gan 
behavior will not oscillate, 
though the phase margin is 


Phase / deg 
ao 
oO 

Gain / dB 
ao 


60 6 
rather low. Since gain and rr ‘ 
time constants are subject 7 7 
to variation in an IC, it 0 
should be at least 60 -20 20 pb ah at ee 
degrees. Sienitun 
Now let's look at a 
real design, a simple, bipolar Fig. 6-13: A zero retards the phase-shift. 
op-amp; this rather outdated 
: 5 .y circuit was chosen because it uses 
7 the slow lateral PNP transistors, 
ae which aggravate the phase-shift 
] ° 20p problem beautifully. 
ag The circuit uses the 
Hy—Ker a2 4Llin- 05} classical 3-stage design for op- 
din lg } 7°" amps (more of this in chapter 8): 
Ct) jem oe an input stage which converts the 
2 ‘Kos differential input signal to a single- 
ole SUB |e ended one and has gain, a second 
7 . Y stage (Q4) which provides more 
\|—4 ne gain, and an output stage which 
. ' has no (voltage) gain but provides 


a a reasonably high output current. 
Since high-current PNP transistors 
Fig. 6-14: Measuring gain and phase in a pee: available in an IC, 
feedback loop. the lower portion of the output 
stage uses a compound transistor; 
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from the second stage it looks like a PNP transistor, from the output like an 
NPN one (but the combined device is achingly slow). 

Q5 and Q7 are diode-connected transistors to bias Q6 and Q8. 

The amplifier is investigated as a buffer, i.e. with a gain of one, 
produced by connecting the output directly to the inverting input. Here, 
though, there is an inductor in the path, which blocks AC but lets DC 
through so that the circuit is properly biased. C2, a very large capacitor, 
couples an AC signal to the negative input. In this way the feedback loop is 
opened up and we can measure loop gain and phase. This can be done at 
any convenient point in the loop, but the output to input connection is 
clearly the most convenient. Note that L1 and C2 have impractically large 
values. This is of no great consequence since these components are not 
going to be part on the design; we want to make sure they don't influence 
the AC behavior of the circuit. 

We feed the AC signal into the loop after the inductor and then 
measure the loop response before the inductor (at "Out"). 

First let's look at the loop without C1. The loop gain is about 92dB 
and the phase drops rather sharply, reaching zero degrees long before the 
gain reaches OdB. (Gain 
and phase have identical ue ut 
scales for easier reading). 
In fact, when the phase 
reaches zero degrees, the 
gain is still about 42dB. 
Therefore this circuit is 
unstable, it will oscillate. 

Cl, the 40 
compensation capacitor, 20 
has been placed at the most 0 
strategic point in the circuit. -20 100 7 —— Hi 
There is considerable 


Phase / deg 
fos 
oO 
dbV @ Out / dB 


Frequency / Hertz 


voltage pain between Ale Fig. 6-15: Loop gain and phase of figure 6-14 without 
base of Q4 and the C1. The circuit would undoubtedly oscillate, the phase 
output, which multiplies reaches zero degrees while there is still gain. 


its apparent value (the 

Miller effect). Without this multiplication we would need a capacitor of 
about 2000pF, too large for an IC. It is also important that the capacitor 
feed back the AC signal from a reasonably low impedance (here the output) 
to a very high one (the current mirror and the base of Q4) so that we get 
nearly the full AC voltage swing at this point. 
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The result is self-evident. A new pole is created, about 100 times lower in 
frequency than the next higher one. This pole now dominates up to at least 
10MHz and the phase is still 65 degrees away from zero when the gain 
drops below one. A stable circuit with an adequate safety margin. 

Of course there is a 


- 4 price to be paid for this 
stability: the gain of the op- 
160 16 ; amp may be more than 
asq & 
140 14 90dB at 10Hz, but it drops 
Phase Margin : : 
120 12 65 degrbes steadily as the operating 
g m0) . 10 frequency is increased. If 
# |, 8 ak we use this op-amp at 
= 60 6 10kHz, we only have about 
40 4 58dB of gain. 
20 2 This analysis has 
ao assumed that the op-amp is 
“20 jo 100 ‘k 10k 100k iM 10M 100M going to be used with a 
Frequency / Hertz gain of one. But if you are 
Fig. 6-16: With C1 the circuit of figure 6-14hasa  “Feating a design with a 
phase margin of 65 degrees, i.e. the gain drops fixed gain, say 40dB, there 
through OaB safely before the phase reaches 0. is no reason why it should 


have to be stable at a gain 
of one. Which makes frequency compensation much less demanding. Just 
look at figure 6-15. Subtract 40dB from the gain curve (only the excess 
gain counts) and the amplifier is almost stable, i.e. a much smaller 
compensation capacitor is required. 

The gain/phase analysis, as elegant and informative as it is, has a 
serious flaw: it shows performance only at one particular operating point (it 
is, after all, an AC analysis which does not disturb DC operating voltage 
and currents). A real-life signal will change the DC operating point and the 
loop gain and phase can change substantially. 

Some simulators let you perform this AC analysis at different DC 
operating points, but there is an easier way , one that is a surefire test for 
stability. Get rid of the inductor and C2, close the feedback loop as 
intended in the application and apply a square-wave at the input. The 
square-wave should have fast edges (the default values in the simulator are 
adequate). 

Then observe the output and watch for overshoot. For this circuit, 
with Cl at 20pF, there is a slight overshoot, one peak only. This circuit is 
very stable. (You can also see that the large compensation capacitor affects 
the slew-rate rather badly). 
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With the compensation capacitor reduced to 5pF there are three to 
four peaks, a damped oscillation. Up to four peaks are acceptable. If there 
are more, you are asking for trouble. 

To make absolutely 
sure, do this with a brief (10 
run) Monte Carlo Analysis at 
the temperature extremes and 
also for a rapidly varying 
load and supply voltage (less 
likely to cause instability, but 
it doesn't take much time to 
check). If there are never 
more than four peaks, you are 
safe. 


Output Voltage / mV 


A final small hint: in Time/pSecs 200nSecs/div 


a. gain/p hase analysis Fig. 6-17: To make sure a feedback circuit does 
simulators often get confused not oscillate observe the pulse response. If there 
about the phase. You will is ringing with fewer than 4 peaks, the circuit is 


see a plot which starts not at atenle. 
180 degrees, but at -180. The 
two are in fact the same. 
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7 Bandgap References 


In February of 1964 David Hilbiber of Fairchild Semiconductor 
presented a paper at the Solid State Circuits conference on "A New 


Fig. 7-1: The 
ancestor (1964). 


put in the missing pieces. 
He recognized that the 
difference in diffusion 1 
profiles was only a 
secondary effect and the 
idea would work better if 
the two transistors were 
made by identical 


Semiconductor Voltage Standard". Zener diodes were 
still very poor and he was looking for something that 
drifted less over time. 

It was already known that transistors with base 
and collector connected together made almost ideal 
diodes. Hilbiber took two of Fairchild's discrete 
transistors with greatly different forward voltages (which 
he attributed to different diffusion profiles) and made two 
strings with different numbers of transistors. He found a 
current level at which - over a narrow temperature range 
(+ 2.5°C) - the voltage difference between the two strings 
changed little and amounted to 1.2567V. He attempted to 
find a relationship between this voltage and the bandgap 
potential of silicon at zero Kelvin, but found that it was 
primarily a function of the semiconductor material used 
in the two different transistors. He got what he was after, 
a much better long-term stability, and stopped at that. 

Nothing happened for six years, when Bob Widlar 


VBE + (n deltaVBE) 


n deltaVBE 


processes. 

If you plot the diode ae deltaVBE_ 
voltage (VBE) over eee 
temperature you will notice an ee id 


that it points at the bandgap 
potential at absolute zero. 


Fig. 7-2: The principle of a bandgap reference. 
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This is not strictly a straight line; it is slightly convex below about 150°C 
and concave above (it asymptotically approaches zero volts). The bandgap 
voltage at zero K, by the way, is strictly a theoretical concept; at that 
temperature there are no semiconductors, in fact electrons don't move at all. 

Widlar found that an equal but opposite temperature coefficient can 
be created by running transistors at different current densities: 


k*T Al* 12 
*In 
q A2* I 


deltaVBE = 


where A is the area (effective emitter area) of each transistor and I the 
current running through it. Here you have the choice of either using 
different emitter sizes, different current levels or both at the same time. 

Delta-VBE is a true straight line, pointing to zero at zero K. But it is 
relatively small. kT/q amounts to about 26mV at room temperature, so an 
area (or current) ratio of 10 gives you a delta-VBE of about 60mV. As you 
can see from the diagram in figure 7-2 you need about 600mV at room 
temperature so it counteracts VBE. 


But Widlar came up with a simple Vee 
solution: multiply delta- VBE with a resistor ratio. ig 
R1 creates a current in Q1. Q2 has ten times the 150u 
emitter area of QI, so there is a delta- VBE * +L lvret 
between the two transistors of about 60mV (at ae 
room temperature). This delta-VBE shows up 
across R2. Ignoring a small error due to the base +— Kas 
current, emitter and collector currents of Q2 are Ko2 
equal. Thus the voltage drop across R3 is delta- 10 
VBE multiplied by the ratio of R3/R2. Addingto “Ke! re | 
this voltage the VBE of Q3 we get Vref. a i tise LL 0 
The three transistor form a feedback Fig. 7-3: Widlar’s first 
loop (with limited gain, thus the internal bandgap reference. 


capacitances are sufficient to keep it from 
oscillating), holding Vref at a constant level. If we increase the value of R3, 
Vref increases and the temperature coefficient becomes more positive. If 
we decrease R3, the opposite happens. In this way we can find the right 
value for R3 so that the negative temperature coefficient of the VBE is 
cancelled by the positive one of delta- VBE. 

Widlar's first design was a bit more complicated, using 14 transistors 
and producing 5 Volts with four VBEs in series and the delta- VBE 
multiplied by a factor of about 40. It is no longer used. 
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There is no such thing as an absolutely precise bandgap voltage. 
You will find that the voltage at which Vref has no temperature coefficient 
can be anywhere between about 1.18V and 1.25V due to several effects. 
First, the bandgap voltage is slightly dependent on the doping level. 
Second, the bandgap potential of a semiconductor changes with pressure (or 
stress). And third, we are using (presumably) diffused resistors which have 
a temperature coefficient of their own. Fourth, as pointed out before, VBE 
vs. temperature is not an exact straight line; thus Vref vs. temperature will 
always show a slight upward bow. 

Nevertheless, such a bandgap reference voltage can have an 
accuracy of better than + 3%, without trimming any of the components. 

Apart from base currents (which can be compensated in more 
advanced designs) there are two main sources of error in a bandgap 
reference: 

1) The VBE. This is an absolute, not a ratio. You have to rely on 
the precision with which dopants can be introduced into silicon in the 
process. In a well-controlled process this amounts to about + 10mV 
uncertainty, or about 0.8%. Be aware that prototypes from a single wafer 
(or even a single run) will not give you any indication how much this varies 
in production over many wafers. 

2) Ratios. In Widlar's first bandgap reference there are two ratios of 
significance: Q1/Q2 and R3/R2 (and also R1/R3). To minimize these errors 
you simply make these devices large. 


. Vec Four years after Widlar, Paul Brokaw 
ke published a paper entitled "A simple Three- 
Terminal IC Bandgap Reference". The core of 
07 te the "Brokaw Cell" is formed by Q1, Q2, Q3, 
as, = Se , : Vvret Q4, R1 and R2. (His actual circuit contained 
as ” a 14 transistors, so it wasn't so simple after all). 
oo 15K The Brokaw cell needs a start-up 
as SH circuit, which has been added here (Q7 lifts 
aoe Vref to one VBE, which is sufficient for Q1 
_Tesus & and Q2? to start drawing current). 


Q2 has 10 times as many emitters as 
Q1 (an arbitrary choice, more would be better), 
so there is a delta- VBE of 60mV (at room temperature) across RI. Q3 isa 
current mirror, forcing Q1 and Q2 to run at the same current. Q4 completes 
the feedback loop from the collector of Q1 back to the input bias of the 
differential pair Q1/Q2 and supplies a moderate amount of output current. 


Fig. 7-4: The Brokaw cell. 
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When the circuit is in balance a multiplied delta- VBE shows up 
across R2. Thus Vref is that voltage plus the VBE of QI. The value of R2 
is selected to achieve a zero temperature coefficient for Vref. 

The gain in the feedback loop is limited, which eliminates the need 
for extra frequency compensation capacitors but results in a relatively high 
output impedance (about 80 Ohms). Because of the emitter-follower output 
transistor (Q4), the minimum supply voltage is 2.2V (0 to 100°C), or about 
1 Volts above Vref. 


Figure 7-5 shows a T 7 
modification of the Brokaw cell K * 
for operation at low supply 30k Kas 
voltage. Output current is now 
supplied by Q6, a somewhat Kor +—Koe Viet 
larger than normal lateral PNP ee il I 
transistor, capable of delivering —* 0 fl > . K a8 ae 
5mA. Q4 forms an additional {Kos s et icc 
gain stage, lowering the output —+4 a ae ey 
impedance to 9 Ohms. Note that {os ine 14.8k 2 Rtext 
the operating current for Q4 is beisia 


carefully set by Q5 and R3, with = = = 
R3 having the same value as R2. Fig. 7-5: Three terminal Brokaw reference 
In this way the base currents of with start-up. 
Q3 and Q4 cancel (and, in 
addition, Q1 and Q2 have identical collector voltages). The minimum 
supply voltage is now 1.6 Volts. 

The design procedure for such a bandgap reference is very simple. 
First you set the emitter ratio of Q2/Q1. The two devices should have 
identical emitters for best matching, Q2 just has more of them. Make the 
ratio as high as you can; with a ratio of 2:1 you get a delta- VBE of only 
about 18mV, which puts a strain on the matching. At 10:1 the delta-VBE is 
about 60mV (again: at room temperature) and the matching requirements is 
less severe. At 50:1 the delta- VBE amounts to about 100mV at which point 
matching becomes easy (you also have a large number of emitters which, 
statistically improves matching). 

With the emitter ratio chosen, you now know the value of delta- 
VBE appearing across R1. You then set R2 so it drops about 600mV; in 
this particular case twice the current flows through R2 as flows through R1, 
so a 5:1 resistor ratio will give you 10:1 voltage ratio. 

Next comes the simulation, and for this you need good models, 
including the temperature coefficient of the resistors. Plotting Vref against 
temperature, you will almost certainly see a marked temperature coefficient. 
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Now simply change the value of R2 until this temperature coefficient is 
zero, end to end. A higher value for R2 will give you a more positive 
tempco. 

Ideally, R1 and R2 should have a ratio so that you can divide them 
into identical sections in the layout. In this example 3k/15k would be 
perfect, breaking the divider into six identical sections of 3kOhms each. In 
reality this rarely happens. You may find that, by changing the value of R1 
(thus drawing more or less current) you can get to this ideal ratio, but if you 
don't there is a compromise: Use a smaller basic section (say 750 Ohms) 
and then get the odd value of R2 by making the last section (or perhaps the 
last few sections) a combination of parallel and series connections of the 
basic resistor element. 

Vref shows the 


es characteristic bow of a bandgap 
1.2348 reference, due to the slight 
1.234: . 
5128 curvature of VBE. This amounts 
ee to about 0.18%. 
ane This curve was obtained 
1.23 using models for a simple 5-Volt 
1.232: . 
1.232 bipolar process. The results are 


0 20 40 60 80 


going to be different for other 
processes, you will need to find 
the optimum value for R2 using 


Temperature/Centigrade 20Centigrade/div 


Fig. 7-6: Characteristic bow in the 


temperature curve of a bandgap reference. your own models. Also, the 
final value for Vref will most 


likely be somewhat different too. 
When you plot Vref vs. 
temperature from a simulation you 
get a false sense of precision. You 
will see the curve of figure 7-6 
only once in a while on a real IC, 
one that happens to have the exact 
nominal parameters. What you 
have to live with is more like 
figure 7-7, obtained from a Monte 
Carlo run. Over a range from 0 to 
100°C the variation is about + 
2.5%. This can be reduced by Taen ian ia decid 
trimming and the best component 
to trim is R2. As you can see there D7 oe eee 
analysis) you will see a larger deviation 
is a distinct relationship between than merely the curvature. 


Vref /V 
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voltage and temperature coefficient. R2 controls both. 

Even with trimming, there is a limit to accuracy. When a chip is 
attached to a lead-frame in a package, there is always some stress. Stress 
changes the bandgap potential and, unless some unusual precautions are 
taken, Vref can change as much as 0.5% (in either direction) compared to 

the value measured (or trimmed to) on 
1.24 the wafer. This can of course be 
i avoided if the reference can be 
- trimmed in the package. 


1.23: 


1.23 7 


Vref /V 


1.23. 


1.23; 


1.2: -4 


1.22: 


1.22 
0 20 40 60 80 100 120 140 160 180 


dbV @ Vref / dB 


Time/uSecs 20uSecs/div 


Fig. 7-8: Pulse response, indicating 
stability. 


1k 2k 4k 10k 20k 40k 100k 200k 400k 1M 
The use of a PNP transistor 


at the output makes frequency 
compensation of a feedback loop 
difficult. This is true especially for 
a slow lateral one. About the only practical way to compensate this 
reference is to place a large (i.e. external) capacitor at the output. Even so a 
small resistor in series with the capacitor is 

required to create a zero (see chapter 6). 1 
The loop is stable but the power supply V 
rejection at 1O0KHz is a mere -30dB. 


Frequency / Hertz 


Fig. 7-9: Power supply rejection. 


Vec 


a = Vref 


It's Widlar's turn again. Four years 
after Brokaw he came up with a whole series 
of new bandgap reference designs. Figure 
7-10 shows one of them. QI and Q2 have a 
4:1 emitter ratio (just to show some variety) 
and their emitters are connected together. 

So the delta- VBE shows up between their 
bases, i.e. across R2. This amounts to: 


deltaVBE = 1n4*26mV = 36mV Fig. 7-10: Another Widlar 
bandgap reference. 
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at room temperature. Since there is only one current flowing through all 
three resistors (save the base currents of Q1 and Q2) the voltage drop across 


Vref /V 


\ 
100C 


0 10 20 30 40 50 


H/pA 10pA/div 


Fig. 7-11: Minimum operating current. 


Q2) and the second stage (Q4) is 


reduced by making Q6 three times as 


large as Q7 and Q8. 


minimum current of 25uA to operate 


This reference requires a 


properly. Above that level the 


impedance at the output is about 10 am 
Ohms. Frequency compensation is 
easily accomplished by enhancing the 


all of them is 36mV(53k/3k) or 636mV. 
Add the 600mV VBE of the diode- 
connected Q6 to this and you get a 
temperature compensated reference 
voltage of 1.236V. (Again, this value 
and the required values for R1 or R3 
may be somewhat different for other 
processes). 

The multiplying resistor as been 
split into two parts (RI and R3) to 
provide enough headroom for the 
transistors to operate. The operating 
current for the differential stage (Q1, 


Vref /V 
is 
ig 
g 


0.15 Percen{ 


1.2349 20 40 60 80 T00 


Temperature/Centigrade 20Centigrade/div 


Miller capacitance of the slowest device, 
Q4, with a 10pF capacitor. 


Vref /V 


Let's look at the variation again. 


1.29 20 40 60 80 T00 


Temperature/Centigrade 20Centigrade/div 


Fig. 7-13: Again the bow is a minor 
factor in the overall variation. 
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Fig. 7-12: Deviation over 
temperature (the bow.) 
Once R1 (or R3) is optimized for near- 
zero change at the temperature 
extremes, we get the inevitable bow. 
For this reference it amounts to 0.15%. 
When we put this bow in 
context, namely add to it the production 
variations due to the absolute value of 
the VBE and the matching variations of 
the resistors and transistors, we get quite 
a different picture. The 0.15% bow is 
overwhelmed by the + 3% overall 
variation. (The variation, however, can 
be reduced to perhaps + 2.3% by 
choosing a larger emitter ratio). 
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At the same time Widlar 


introduced his new designs he also 


came up with a way to reduce the 


bow, a method which is now called 


second-order temperature 
compensation. 


To illustrate it we use the same 


bandgap reference again, with one 
transistor added. A portion of the 
voltage across the resistor string is 


tapped by the base-emitter diode of 
Q9 with a large-value resistor. The 


tapped voltage has a positive 


1.222) 


1.2221 


Vref/V 


1.2224 


1.222) 


1.2226 20 40 60 80 TOO 


Temperature/Centigrade 20Centigrade/div 


Fig. 7-15: The bow reduced by second- 
order temperature compensation. 


process. The net result (after 
several adjustment cycles) is a 
flatter curve, showing a deviation 
of just 0.04%. 

But let's put this in context 
again. We may have straightened 
the nominal curve, but it is still 
subject to the variations caused by 
VBE and matching. Adding this 
(Figure 7-16) we see little or no 
improvement in overall accuracy. 
For this reason second-order 


Chapter 7: Bandgap References 


= a E Vref 


Qg 


Qi 


» oH 


Q 
Q7 
2 Ko 


Fig. 7-14: Bandgap reference with 
curvature correction. 


temperature coefficient, the base- 
emitter diode a negative one. At about 
40°C Q9 and R5 start feeding a small 
current into Q6, which increases as the 
temperature is increased. This bends 
the right-hand side of the 
characteristic bow upward. R1, R2 
and R4 then need to be adjusted to 
level the curve, a somewhat delicate 


Vref /V 


Temperature/Centigrade 


20Centigrade/div 


Fig. 7-16: The overall variation now 
overwhelms the remaining bow, so this 
approach should only be used with trimming. 


curvature correction only makes sense if a bandgap reference is trimmed in 


Edition February 2005 


7-8 All rights reserved 


Camenzind: Designing Analog Chips Chapter 7: Bandgap References 


a very sophisticated way, reducing production variation to considerably less 
than 1%. 


Vee Here is a more modern bandgap reference 
R3 which is more accurate (without trimming) than the 
previous examples. Just 4 transistors are used, one 
with a dual base/emitter. It is basically a two- 
terminal reference, fed by R3. There are two 

is |* VBEs in series, so the output voltage is twice the 

40.5k ep | _ bandgap potential, 2.45V. With R3 = 25kOhm the 
_— optimum Vcc range is 4.5 to 5.5V. 

- The delta-VBE appears across R2, given by 
ple, Ke" the 24:1 emitter ratio of Q3 to QI (about 83mV at 
mH 


Vref 


a4 room temperature) and is multiplied by R1 to about 
ite 1.2 Volts. The difference here is the placement of 
el R2 in the collector circuit of Q1, thus subtracting 
rather than adding the delta-VBE. One VBE is that 
Fig. 7-17: A different of the lateral (split-collector) PNP transistor Q2, the 
design forabandgap — other the NPN transistor Q1. Lateral PNP 
reference (2.5V). ‘ fe pects 
transistors generally have a narrower variation in 
VBE, but work only over a limited current range. 
The error signal is picked up by a Darlington transistor (Q4, one 
collector region, two base-emitter patterns). 

Variation in production over a temperature range of 0 to 100°C is a 
mere + 1.6%. As always, the values given here are for a specific process, 
with fairly large dimensions (the resistors are 4um wide). You may need to 
adjust R1 for other processes (and certainly for other emitter ratios). 

The circuit is stable with a load capacitance of less than 5OpF or 
greater than 200nF. With a 330nF capacitor at Vref power supply rejection 
is -60dB, increasing further above 10kHz. The output impedance is 25 
Ohms. The circuit is intended as a reference only, but it can sink several 
milliamperes. If more sourcing current is needed, you simply decrease the 
value of R3. 

It is possible to modify this circuit for 1.2 Volts but, as a 
consequence the performance suffers a bit (which is almost always true 
when you move to lower voltages). 

In figure 7-18 only a single diode-connected transistor (Q1) is used. 
A second one mirrors one-third of the current, which is compared with the 
mirrored current of Q4. Here the emitter ratio is 20:3. A second stage (Q5) 
increases the loop gain, lowering the output impedance to about 1.7 Ohms. 
R3 is optimized for operation from 3 to 3.6 Volts, consuming 90uA. 


Qi 
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Production variation from 0 to 
100°C is + 2.2%. 

Frequency compensation is a 
bit tricky. With a load capacitance of 
500pF or less the circuit is stable and 
has a power supply rejection of -80dB 
below 10kHz, -60db at 100kHz and a 
peak of -40dB at 1MHz. 

Figure 7-19 shows the same 
circuit, transformed into a 3-terminal 
reference, or a mini voltage regulator. 
It uses an NPN transistor to supply the 
output current, which delivers a greater 
current than a (lateral) PNP transistor 
and makes frequency compensation an 


24 
R3 R2 
1.1k 5.251 N\ 
aa Kt 


R4 gy 
a8 300 gon 


Fig. 7-19: Three-terminal version of figure 7-18. 
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Vec 


a Vref 


7 
R1 
42.75k Q3 


YY as 
Re as = 
5.25k 
10p 
a4 7 


Fig. 7-18: A similar circuit, but with a 
single VBE (1.25V). 


ve easy job, but only works down 

to 2.2 Volts supply voltage. 
The base current for the 

ver OUtput transistor is supplied by 
an independent current source 
consisting of Q6, Q8, Q9 and 
Q10. This is the self-starting 
current source discussed in 
chapter 5, figure 5-12. The 
last transistor of the bandgap 
reference, Q5, diverts the 
unneeded current from Q6. Q6 
supplies about 100uA. With a 
maximum hFE of Q7 (at high 
current) of 100, the output 
current is limited to about 


10mA (depending on the size of Q7), which prevent burn-out. 
Production variation (3-sigma) over a temperature range from 0 to 
100°C is + 2.4%. The output impedance is 1.5 Ohms and the circuit is 


stable with any load capacitance. 


Low-Voltage Bandgap References 


The principle followed in the bandgap references so far has been 
this: add two circuit elements with equal but opposite temperature 
coefficient, so that the sum of the two has a temperature coefficient of zero. 
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One of the circuit elements is a diode, which has a voltage drop of about 
600mV. The second circuit element is a multiplied delta- VBE, which also 
amounts to about 600mV. Therefore the minimum reference voltage 
possible is about 1.2 Volts. 
This is only true if we add these two voltages. There are other 
approaches which avoid addition. Let's look at two of them. 
Voc Here is the ferment mind of Bob Widlar 
| i again. As early as 1978 he suggested a circuit 
100u which works down to | Volt supply, a single 
battery. 
oe a First off he uses just about the largest 
=r emitter ratio practical, 50:1 between Q2 and Q1. 
at This gives a delta-VBE of about 100mV at room 
1 ]ver temperature. 


50.25k Am The VBE appears at the base of Q1 to 
i ground. Thus the voltage at the entrance of the 
i current source is higher by a fraction of a VBE. 
TOsus Now assume RS to be zero. Thus the voltage at 


— eS Vref is that fraction of a VBE plus the delta- VBE of 
the 50:1 ratio in emitters of Q2 and Q1. If RI is 
dimensioned such that the fraction of the VBE 
amounts to about 100mV, then we have a 
temperature-stable Vref of 200mV. 

R5 provides some compensation for changes in I] and connecting 
R4 to a tap at R2/R3 rather than ground creates a minor amount of second 
order temperature compensation. 

A Vref of 200mV is just about the maximum value you can get from 
this design. Even with a much larger emitter ratio, say 200:1, delta- VBE 
only amounts to 138mV, i.e. Vref would be about 272mV. 


Fig. 7-20: 200mV Vref 
design by Widlar. 


The second approach is considerably more complex but has greater 
flexibility. Two currents are created, one with a positive temperature 
coefficient, the other with a negative one. Summed, they produce a voltage 
drop in a resistor and this voltage drop has a near-zero temperature 
coefficient. 

The first current depends on the 3:1 emitter ratio of Q6 and Q4 and 
the fact that Q4 runs at twice the current compared to Q6. Thus the 
effective emitter ratio is 6:1 and the current is determined by the delta- VBE 
(47mV at room temperature) and R1. The feedback loop has a gain of 3, 
carefully controlled by the 3:1 emitter ratios of Q1/Q3, in this way the loop 
is frequency-compensated by the device capacitances. The loop is self- 
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starting by leakage currents and the collector currents of Q2 and Q5 have a 
positive temperature coefficient. Two identical currents are derived by Q7, 
one feeding the 
output resistor R3, 
the other starting 
the second current 
source. 

The 
second current 
depends on the 
VBE of Q8 and 
the value of R2. 
Again, the loop 


has a limited Fig. 7-21: Bandgap reference with a minimum supply voltage of 
and well- 0.9 Volts. 


controlled gain 

(the emitter ratio of Q10/Q9 and the 2:1 collector ratio at Q11/Q12), but a 
small frequency compensation capacitor is still required. The collector 
currents of Q1I1 and Q12 have a negative temperature coefficient and one 
collector of Q12 feeds the output resistor R3. The sum of the two currents 
flowing through R3 cause a voltage drop of 250mV, with a temperature 
coefficient near zero. 

The two currents can be adjusted independently with the values of 
R1 and R2, allowing fine-tuning of the temperature coefficient. The 
magnitude of the output voltage can be selected with the value of R3 
without affecting the temperature coefficient. 

Note that the currents depend on the resistor values. They will vary 
in production but R3 tracks these variations and the output voltage depends 
only on resistor matching. 

The output impedance is that of R3. Unless the load draws only a 
very small current you will need an output buffer. 

This bandgap reference works down to 0.9 Volts supply and the 
change in output voltage from | to 1.5V Vcc is 0.25%. Power supply 
rejection is -55dB up to 10kHz. To keep this low at higher frequencies you 
will need an external capacitor (10nF) at the output. 

Production variation is + 3.6% from 0 to 100°C, which illustrates 
that the lower the supply voltage the more difficult it is to get high 
performance, even if a more elaborate circuit is used. 
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Chapter 7: Bandgap References 


Let's face it: a bandgap reference is a bipolar concept. It needs a 
diode and the difference between two diodes. And the only diodes good 
enough are diode-connected bipolar transistors (or, in some designs, the 
base-emitter diodes of bipolar transistors). 
Fortunately there are some layers in a CMOS integrated circuit 
which, although not intended for this purpose, can be used to make a 
passable bipolar transistor. The most obvious ones are those used for a p- 
channel transistor, the p-type region (source, drain) forming the emitter, the 
surrounding n-well the base and the substrate the collector. 

Such a device has limitations. First, the collector is permanently 
tied to the lowest supply voltage. No flexibility there at all. Second the 
gain (hFE) is very low, about 7. In a bipolar process we rely on the high 
gain (at least 100) to effectively eliminate the base resistance as a source of 
error. So the CMOS substrate PNP transistor only works if we make it 
large (which we probably want to do anyway to get reasonable accuracy). 

It is also possible to make lateral PNP transistors in CMOS, using 
the p-channel drain/source diffusions as both the emitter and the collector. 
Such devices have a reasonable gain (100 or so) but, unlike the substrate 
devices, they are hardly ever characterized by the foundry, which means 
you can't consider them unless you want to spring for a rather expensive 


= 7 [V+ 
M5 if 4a 
=lu We1 We1 
=1u L=1 L=1uU 
M12 
a 


50k 


Fig. 7-22: CMOS bandgap reference with substrate diodes. 
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evaluation run. 
For this 

reason we will 
consider only a 
CMOS bandgap 
reference using 
substrate PNP 
transistors here. Q2 
has a single (10um 
x 10um) emitter, 
Q1 has 24 of them. 
Q2 is usually in the 
center, surrounded 
by 2 rows and 
columns of 
identical Q1 
devices. Get used 
to it: this pattern is 
very large 
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compared to a 0.12u, 0.18u or even 0.35u CMOS device. 

The delta-VBE appears across R3, with R1 and R2 being equal. The 
error voltage is amplified by M1, M2, M3, M4, M10 and M12. These 
devices need to be as large as indicated. For M1, M2, M3 and M4 the 
prime requirement is matching, which gradually improves as the area 
(channel length times width) is increased. (Keep in mind that we are 
working down at a level of a delta- VBE, which here amounts to about 
82mV). For M10 and M12 the width needs to be substantial to get 
sufficient gain (transconductance) and an increased length helps to reduce 
the influence of power supply variations. To reduce this even more M9 (a 
cascode stage) has been added. 

M7, a narrow and very long transistor starts the circuit by feeding a 
small current into the loop. Once sufficient voltage appears at Vref, M6 and 
R4 take over and supply the operating current, mirrored by M5, M6 and 
M11. 

M12 is a p-channel transistor, which provides a low minimum 
supply voltage (1.5V) but, as we have seen before, make frequency 
compensation difficult. The only practical way to do this is with an external 
capacitor, though placing it at the output also provides for a good power 
supply rejection (-60dB). The output impedance is 0.5Ohms up to about 
ImA. 

With the transistor sizes as shown and the resistors 4um wide you 
can expect a production variation of + 1.8% over a temperature range from 
0 to 100°C. 


A word of caution: A bandgap reference is the ultimate test of 
accuracy for device models. For example, it is very difficult to measure a 
VBE over temperature accurately enough on a wafer so that it will predict 
the behavior of a bandgap reference. With most processes you need to 
make a bandgap reference to verify the models. 
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8 Operational Amplifiers 


Op-amp design is a specialty, peopled by a small group of engineers 
forever dedicated to the quest of finding the universal building block. None 
has ever been found (hence the large number of different op-amps) but still 
they toil. Year after year they come up with small improvements; and each 
new design has one imperative requirement; it must work in any application 
without creating smoke or - heaven forbid - oscillation. 

When designing an op-amp for an ASIC the precise application is 
known, so the circuit does not need to be universal and the task is easier. 
Not exactly a cinch, but nothing compared to what a designer of 
commercial op-amps has to face. 


The majority of op-amps have three stages. The first stage converts 
the differential signal into a single-ended one; the second one provides the 
bulk on the gain and the third one the required output power. There is no 
law that says it has to be this way, it just turned out to be an approach that 
works well. 


Bipolar Op-Amps 


In our first circuit Q1 
and Q2 form the differential 
pair and Q3, a split-collector 
lateral PNP transistor, is 
connected as a current mirror 
or active load. At the 
collectors of Q2 and Q3 we 
have a high impedance, limited 
only by the base current of Q4 
and the Early effects in Q2 and 
Q3. The second stage, Q4, has 
as a load the current sink Q8 
and the base current of Q5, thus 
its gain is also limited only by those two. The output stage is a simple 
emitter follower with a pull-down current sink. 


Fig. 8-1: Simple 3-stage op-amp. 
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The operating currents for all three stages are derived from I1 
through a multiple current mirror. Q9, with two emitters, delivers two times 
I1, an arbitrary choice; more emitters could be used if more current is 
required at the output for negative-going signals. 

Q7 and Q8 are identical, which is no arbitrary choice. By making 
the collector currents of Q4 and Q3 (both halves) equal, Q3 takes as much 
base current from Q1 as Q4 does from Q2. Thus there is no designed-in 
base current error and the offset at the input is zero (for ideal matching). 


Common Mode Range: Describes the minimum to maximum DC level at which the 
two inputs are functional. Desirable is rail-to-rail, i.e. from the negative supply to 
the positive one, but many op-amps only work with the inputs a volt or two above -V 
(or ground for a single supply) to some voltage below +V. 


Common-Mode Rejection: If you connect both inputs together, bias them at a 
functional DC level and superimpose on the bias a small AC voltage, no signal should 
ideally appear at the output. In reality a small signal leaks through and the measure 
describes how much smaller this signal is (in dB) compared to the input. 


The common-mode range in this design has limitations. The two 
inputs need a dc level of at least one VBE (Q1, Q2) plus a saturation 
voltage (Q7) above -V. If they move below this level the input pair gets no 
operating current. Also, the inputs need to be about 200mV below +V, 
otherwise Q1 or Q2 saturate and Q3 or Q4 are without current. At the 
output Q5 can pull the output only to within a VBE of +V. 

There is also another flaw: being bases of bipolar transistors, the 
inputs need a current. With a minimum hFE of 100 and 50uA flowing per 
transistor, this amounts to base current of 0.5uA worst case. With a 
100kOhm input resistance for one input and zero for the other this could 
amount to as much as a 50mV error at the output. 

Frequency compensation is achieved with a single capacitor from 
the output to the high-impedance node at the output of the first stage. It 
could have been placed just from collector 
to the base of Q4, but the simulation shows 
a small advantage for the shown 
configuration. 

Let's first double-check this 
frequency compensation. To do this (as 
/ explained in more detail in chapter 6) we 
. " place a very large inductor in the feedback 
me loop and feed an AC signal into the input 
through a very large capacitor. The 
Fig. 8-2: Simulating loop gain inductor (1MH, i.e. 1 million Henry) 

anc PNase: provides the DC bias to the input but blocks 
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the AC signal. You want to choose the inductor and capacitor large enough 
so they have no effect at even the lowest frequency of interest for the 
circuit. Neither 1 million Henrys nor 1 Farad are practical values; they 
don't need to be. 

And here is what we 


co y2 v1 
get: The open-loop gain is ‘. 
about 85dB. The dominant 160 ‘6 = 
. ase 

pole, given by the 149 4 
compensation capacitor, is at a 12 
about 2kHz, after which the gM 10 

F - o 80 = 8 : 
gain decreases steadily and fl let 
reaches unity (OdB) at about és ’ 
12MHz. At that point the 20 ; 
phase is still at about 50 0 
degrees, marginal but probably = {00 1k 10K —T00R TM TOT -5OM 


adequate. 


But as poled out a Fig. 8-3: For stable operation the gain of a loop 
chapter 6, a phase-margin must reach OdB before the phase reaches 0 
analysis is not the real test of degrees. 


stability. AC analysis uses 

infinitely small signals (even if it says the signal is | Volt) and the operating 
currents and voltages are not disturbed. So, to be certain, we would have to 
repeat this analysis for the DC conditions over the entire (large-signal) 
range of the circuit, a tedious task at best. 


Frequency / Hertz 


C) “ We get a more immediate picture by 
ay 6 observing a large-signal pulse. 
a ee ae To do this we eliminate the 
in -\Y¥ inductor and capacitor in the feedback 
a 1 
0. 
Fig. 8-4: Buffer connection. 0. 
path and connect the signal to the 7 Output 
input. Without resistors in the a Input 
feedback path, this is a buffer 0. 
connection, i.e. a closed loop a 
. . . -0. 
gain of one; with the entire open- os 
loop ean (85dB) being judged, % 100 200 300 400 500 600 700 800 900 
Betti most Seve test for Time/nSecs 100nSecs/div 
stability. 
The output waveform Fig. 8-5: Input and output waveforms for the 
shows the amplifier to be very BUNGE COnMScuON. 
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stable, there is no ringing, just a small overshoot. 

But the curve shows something else: rise and fall-times are 
substantial and almost straight lines. This is the slew-rate, the time it takes 
to charge and discharge the 10pF capacitor over a 2-Volt span with the 
operating current. You can speed it up by increasing the operating current, 
at the cost of power consumption. 

In doing this test we assume that the op-amp needs to be operated as 
a buffer. What if, in a specific application, the closed loop gain is never 
lower than 40dB? In such a case we only have an excess open-loop gain of 
about 45dB, which makes compensation considerably easier. 

Let's examine this. In figure 8-6 we have the same circuit as in the 
loop gain analysis before, except that the feedback resistors are in the loop 
also. 


y2 v1 
ae 
Int ay 5 160 16 
= Fig. 8-1 Out " ‘ - Phase 
-V —- 120 12 
| -V 
n C) B 100g 10 
5 9 so] = 8 
£ 5 
a 
R2 | Ri 1Me % ° 
40 4 
1K 99k | ou Gain 
7 == Gi 20 2 
0 


Vac Frequency / Hertz 


Fig. 8-6: Simulation of loop gain and Fig. 8-7: With the lower gain the loop shows 
phase with closed-loop gain of 100. greater phase margin, even though the 
compensation capacitor (C1 in figure 8-1) is 
reduced to 2pF. 


The gain now shows a 45dB 
maximum instead of 85dB and thus 
drops to OdB at a lower frequency. We 
can now reduce the value of the 
compensation capacitor from 10pF to 
7 2pF and still have a phase margin of 
over 60 degrees. 

When we check the stability 
with a pulse (with only the resistors in 
the feedback loop and the signal 
Fig. 8-8: Pulse response with 40dB_ = connected to the positive input) we see 
oop :daln shore ee ald that the amplifier is just stable enough 

om (fewer than 4 peaks in the damped 


Output Voltage / V 


-t 


0 02 #04 06 08 1 1.2 414 16 18 


Time/uSecs 200nSecs/div 
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oscillation). And, because the compensation capacitor is smaller, the slew- 
rate is substantially higher. 

No consideration has been given so far to noise and noise 
performance is in fact not that great in this design. The majority of the 
noise is created in the input stage; in subsequent stages the signal is larger 
and the influence of noise is 
correspondingly reduced. To 
lower the noise in QI and Q2 the 
devices need to be larger and 
their operating current higher. 

Figure 8-9 shows the 
noise performance of this 
amplifier in the buffer 


200nq 


100 


It= 10uA 


Output Noise / V/rtHz 


11 = 100uA 


configuration (since the gain is 1, 20n 
output noise and input noise are 
equal). As you can see, lowering {902000 Tk BRB AK To 
the operating current increases Fig. 8-9: Noise of the amplifier in the buffer 
noise, an unpleasant fact of life. connection. 
t 7 ne Naturally you can invert 
Pai Kos Kas the polarity of the transistors and 


design an op-amp whose input 
can operate close to the negative 


mm {Kor oe | am 4—mmour Supply (but loses its ability 
. To within about 1 Volt of the 
\Q" - positive supply). Figure 8-10 is 
50u + ' 4 Kas 


the PNP-input equivalent of 
figure 8-1, still with a rather 
| | i 1==y. primitive output stage (Q6, Q9), 
which can pull the output to 
Fig. 8-10: PNP-input equivalent of figure 8-1. within about 150mV of the 
positive supply (if the load does 
not require more than 50uA), but only down to about | Volt above the 
negative rail. 


Let's now consider a design in which the inputs can be operated all 
the way down to the level of the negative rail and the output swings 
(almost) rail-to-rail. The input stage in figure 8-11 is a configuration known 
as the folded cascode stage. With an operating current (I1) of 10uA the 
voltage drop across R1 and R2 is a mere 50mV, thus the inputs can go 
about 250mV below the negative supply rail without saturating Q1 or Q2. 
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The collector currents of QI and Q2 upset the balance of the Wilson current 
mirror Q3, Q4 and Q5 and the difference signal is picked up by Q6. 


The output a C. Cc 
stage has two Z K Harz ato ons 
branches to it. The " Tk 
first one is simply + FKt 
Q14, a grounded ke ye 
emitter amplifier. a 2 ie — r| jm out 
All other transistors om—_* am ko , = 
in this block serve NX 
its antipode, Q13. io" ole Ke 

Note the me | | a 
three diode Ri Re a a 
connected ; 5 sup _| ty 


transistors Q7, Q8 Fig. 8-11: Op-amp with folded-cascode input stage and 
and Q9. They set a (almost) rail-to-rail output. 


voltage for the base 
of Q11. If you follow the emitter-base junctions of Q11, Q12 and Q14, you 
notice there are also three diodes in series to the V- rail. Thus, as the input 
signal to the output stage moves up and down (by a few millivolts), the 
current in Q1I1 fluctuates. It is this current, amplified by the size ratio of 
Q13 to Q10 (here about 6) that becomes the pull-up portion at the output. 
Q7 to Q9 are deliberately made larger than Q11, Q12 and Q14 so that the 
idle current in the output is small. This creates a small "dead-band" (see 
chapter 16) but, because of the large loop gain the distortion is very small 
(0.0004% for a + 4.7Vp signal at 1kHz). 

In this circuit we are 


oe fortunate to find a node ideally 

ae Pe suited for the connection of a 

140 4 compensation capacitor to the 
- ; hase output: at the base of Q6 the signal 
= ph 36 has a phase opposite to that at the 
F 6% 6 il output; the base of Q6 and the 

a ‘ collectors of Q4 and Q16 all 

; represent a high impedance; and 

-20 at t+ thts), there is substantial voltage gain 

cacustaeleke between it and the output. Which 
Fig. 8-12: Phase margin of figure 8-11. all says that this op-amp can be 


compensated (at unity gain) with a 
single 5pF capacitor, even though 
the loop gain is 110dB. Not all designs behave that well. 
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Though the inputs can 
work at the level of V- (or 
ground if you have only a 
single supply), the output 
cannot. It is the sad truth for a 
bipolar transistor that there is a 
saturation voltage. Unlike an 
MOS transistor, which is 
simply a voltage-controlled 
resistor, the bipolar transistor is 
the interaction of two junctions 
with different doping levels 
and sizes. Even when fully Figure 8-13: Output swing is limited by the 

: ene saturation voltages of Q13 and Q14. 
turned on there is a minimum 
voltage drop of about 150mV 
between emitter and collector in transistors Q13 and Q14. Thus the output 
can never be at the rails, only approach them. 

The use of lateral PNP transistors at the input and the low operating 
current is not kind to noise: 27nV/rtHz at 10kHz and up (white noise). At 
1Hz the flicker noise rises to 80nV/rtHz. If you have vertical PNP 
transistors at your disposal and can afford a higher operating current, these 
figures drop by a large factor (but you need to carefully re-simulate the 
entire circuit; frequency behavior is bound to be entirely different). 

Another unsatisfactory parameter is the input current. Each input 
transistor runs at SuA; with a minimum hFE of 100 (in a good process) the 
base current can be as high as 50nA. But there is a solution to this: more 
transistors. 


Out/V 


0 0.2 0.4 0.6 0.8 1 


Time/mSecs 200uSecs/div 


Fig. 8-14: Op-amp of figure 8-11 with base-current compensation for the input stage. 
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In figure 8-14 eight transistors are added. Their job is to pull as 
much current out of the bases of Q1 and Q2 as they naturally require, so 
that he external circuit does not have to do this. 

The key is Q22. It is identical in size and design to Q1 and Q2, has 
the same operating current and very close to the same collector-base voltage 
(created by the base-emitter voltage of Q23 and the diode-connected 
transistor Q24). Thus its base current must be the same as those of the input 
transistor. This base current is mirrored by Q21, Q20 and Q19 and fed to 
the inputs, opposing the two base currents. 

The cancellation of the base currents is never perfect of course, the 
current levels are too small to get precision matching. But the net input 
currents are down to 2nA, a 25:1 improvement. 


In the last bipolar op-amp the goal is not an ultra-low input current, 
but low-noise performance with a reasonably low input current. 


7 Hee, : 
is" ee : 


10 5p 
In | Qt Q2 In+ Out 
10 10 
Q16 
Q8 Q14 Qs 
Q7 
SUB ao a a 


Fig. 8-15: Bipolar op-amp optimized for low noise. 


This circuit is almost identical to that of figure 8-1. The input 
transistors are now again NPN, but with the base-current canceling scheme 
added. The key transistor is Q13; since the hFE of an NPN transistor 
changes much less with current than that of a PNP device, we can afford to 
run it at twice the current and then divide the base current by two in the 
current mirrors Q12/Q11. This brings the input current down to 20nA. 

The operating current is much higher than that of the previous 
circuit and the input transistors are large, which lowers the white noise to 
5nV/rtHz. 
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CMOS Op-Amps 


CMOS devices have two advantages for op-amps over bipolar ones: 
there is no input current (at least not at DC) and, when the transistor is fully 
turned on, there is only a simple resistance between drain and source (not 
some complex cancellation of two junctions, resulting in an offset voltage 
and a resistance). , t , ——_— 

Let's again ii it if 
first look at a simple wesoo] | (wane) | foams 
design. As in figure 


L-5u L-5u L-5u 


8-11 a "folded mt | [me] 
cascode" input stage | ' s 
is used, this time 7 08 
using N-channel _ i ” 


transistors (M5, 
M6). The primary 
current, I1, is 
mirrored with M1- 
M4 and then again 
in M7-M10, using 
the circuit of figure 


3-24, so the * cee ; . oo 
operating current Fig. 8-16: Op-amp with folded-cascode input stage and a 
for the input pair is simple (and limited) output stage. 


a constant 20uA. 

The four transistors M1-M4 also steer M11/M12 and M13/14, 
producing two more accurate currents of 20uA each. The drains of the 
input transistors are connected to the sources of M12 and M14, which have 
a potential about 200mV below V+; thus the inputs can operate up to (and 
about 100mV above) the positive supply. 

At balance (In+ = In-) the input pair diverts half of the 20uA current 
produced in M11 and M13, i.e. M12 and M14 are left with only half the 
current, about 10uA each. The current out of M12 is mirrored in M15/M16 
and opposed to that flowing out of M14. With a large input signal the two 
currents become unbalanced and each can vary between zero to 20uA. The 
voltage created by this unbalance is amplified by the output stage (M18 and 
a simple pull-up current source, M17). The idle current of the output stage 
is set by the ratio of the channel widths of M1 to M17, 1.e. 60uA. 
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The circuit is ye v1 
compensated with a single 
1pF capacitor, utilizing the “ 8 
Miller effect of M18. This 
works well as long as the 
load capacitance is small. 
At 10pF the stability is 
marginal when connected : 10pF [Load 
as a buffer; if the closed- es 
loop gain is never lower 2 
than 10 however, the op- 100 
amp is very stable, even 


with a load capacitance as 
hich as 500B P Fig. 8-17: Phase margin becomes critical with a 
1gN as 2 . capacitive load, unless the closed-loop gain is 
With a load resistance 40dB or larger. 


of greater than 25kOhms 


100 6 


50 


Phase / deg 
Gain / dB 


10 100 1k 10k 100k 1M 10M 100M 


Frequency / Hertz 


(60uA) the output can move rail- 
to-rail (or, more precisely, to 
within about 100mV of each rail). 
All CMOS examples in 
this chapter assume a split power 
supply of 3 Volts total, or +1.5V; 
they are operational down to 
+0.8V, though with reduced 
performance. 
You need to be aware of 
the changing open-loop gain. At 
Fig. 8-18: Output swing. ground level it amounts to 100dB. 
As the output moves close to 
either supply there is a marked drop (66dB at -1.4V, 60dB at +1.4V). This 
can be improved by making the output devices larger. 
What cannot be improved is a fundamental dependence of loop gain 
on the load impedance. The lower the load, the lower the loop gain. 
The input stage has a limited common-mode range. It will work up 
to about 100mV above the positive rail, but not below about -0.8V, i.e. 
about 0.7V above the negative rail. 
Let's convert the wimpy output stage into a true rail-to-rail one, with 
some current capability: 


1 


Output Voltage / V 


1.59 02 0.4 0.6 0.8 1 


Time/mSecs 200pSecs/div 
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Fig. 8-18: Op-amp with a more capable rail-to-rail output stage. 


The trick in designing a rail-to-rail output is in the biasing of the two output 
transistors. You want a small but well-controlled idle current to minimize 
any uneven behavior as the output signal is switched from one transistor to 
the other. In this circuit there are eight transistors whose only job is to set 
this idle current. 

Follow M25 and M26, two "diode-connected" n-channel devices, 
fed by the current source M24. At the gate of M23 we then have a DC 
potential of about 1.2V above the negative rail. There is a second path from 
this node to V-, through M23 and M29, also n-channel transistors. Thus the 
current in M29, one of the output transistors, depends on the current 
supplied by M24 (and derived from I] through M1 and M2) and the channel 
dimensions of M23, M25, M26 and M29. An identical arrangement is 
provided for M27 by M19 to M22. With the dimensions shown the idle 
current amounts to 70uA. 
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In this circuit we also have two higher-performance current mirrors: 
M15 to M18 (see figure 3-24) to get the maximum open-loop gain and M5 
to M8 (see figure 3-25) to get the highest possible common-mode rejection 
(now 98dB, up from 94dB in figure 8-16). 

The output stage is capable of supplying 1mA peak and can get 
within 100mV of the rails with a 5kOhm load. This performance can be 
increased by making M27 and M29 wider. 

Though capable of a much higher current without wasting idle 
power, this rail-to-rail output has the same weaknesses as the previous one: 
it is very sensitive to capacitive load and the open-loop gain is 
fundamentally related to the load 
impedance. With no load it is 
' 105dB; with a 100kOhm load the 
open-loop gain drops to 101dB, 
with 10kOhm to 88dB and 1kOhm 
it reaches a paltry 68dB. This, 
alas, is an unpleasant fact with 
rail-to-rail outputs. 

Larger input transistors are 
also used in this circuit, which 
reduces the white noise to 
23nV/rtHz (28nV/rtHz in the 
previous circuit). 


a 


Output Voltage / V 


-1 


1.59 02 04 0.6 0.8 1 


Time/mSecs 200uSecs/div 


Fig 8-20: Output limits with 5kOhm load. 


In the next circuit (figure 8-21) the polarity of the input stage is 
reversed and the current mirror for the second stage (M11 to M14) is 
designed to have the highest possible output impedance, resulting in an 
increased loop gain. Also note that the primary current has been 
(arbitrarily) reduced to 5uA. 

The lower operating current level has only a minor effect on the 
sizes of most devices; they still need to be large to obtain satisfactory 
matching, much larger than the process (0.35u, or the higher-voltage portion 
of a 0.18u process) would allow. The idle current of the output stage is now 
reduced to 10uA. 

Open-loop gain is 107dB at low frequency and with no load. 
Capacitive loading is still a problem (but much reduced if the minimum 
closed-loop gain is higher than 1) and, as before, the closed loop gain is a 
function of load impedance. 

The input operating range now extends from about +0.8V to 150mV 
below the negative rail. If a single supply is used the inputs can function at 
or below ground level. 


Edition February 2005 8-12 All rights reserved 


Camenzind: Designing Analog Chips Chapter 8: Operational Amplifiers 


Because of the large dimensions used for the input transistors the 
white noise level is a relatively low 21nV/rtHz. Note, however, that they 
are run at twice the level of I1. 
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Fig. 8-21: Op-amp with P-channel input. 


Now let's extend the operating range of the input by using both p- 
channel and n-channel devices. In the circuit of figure 8-22 we are adding 
an n-channel differential pair which takes over when the DC level at the 
inputs reach about +0.8Volts and the p-channel devices get cut off. There 
are three voltage regions for the input now: within 0.8 Volts of the negative 
rail only the p-channel devices are active; from about -0.8 Volts to +0.8V at 
the inputs, both pairs amplify and within 0.8V of the positive rail only the 
N-channel devices amplify. 

When both pairs are active, the open-loop gain is at a maximum, 
reaching 120dB. When the common-mode level is either high or low, the 
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loop gain drops by 10dB. Many schemes have been offered in the literature 
which hold this gain more constant (e.g. by allowing only one pair to 
operate at a time), adding another dozen devices. For most applications the 
benefits of this measure are limited; in fact simultaneous operation of both 
pairs increases performance (noise, for example drops to 20nV/rtHz, 
compared to 30nV/rtHz when only one pair is amplifying). 
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Fig. 8-22: Op-amp with rail-to-rail inputs and output. 


The two input stages work with the second stage (M11 to M18) as 
folded cascodes. The lower part of the second stage (M15 to M18) is a 
current mirror derived from M1 to M4, set here at about 10uA; the upper 
part (M11 to M14) mirrors the current again, so that at M19/M23 the 
currents cancel if there is no input signal. M19 and M23 set the bias current 
of the output transistors (M27, M28) 
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With an input signal one or both input stages change the currents in 
the second stage, resulting in a net positive or negative current through 
M19/M23, which is translated into a larger current at the output. 

A common problem in op-amps using two separate input stages is 
created by the random nature of the offset voltage. Suppose one pair has an 
offset voltage of +5mV the other -SmV. As the signal moves from one 
stage to the other, this causes a jump of 10mV, creating distortion. 


Auto-Zero Op-Amps 


Auto-zero or chopper stabilized amplifiers have been around for 
decades but continue to evolve. In a modern embodiment two amplifiers 
are used, checking up on each other. 

Each amplifier has a "Trim" input, i.e. a single node which changes 
its offset voltage in both directions. 

A built-in oscillator flips the two switches periodically at a rate of a 
few hundred to a few thousand Herz. In position A the inputs of amplifier 2 
are shorted together and its own offset voltage is amplified (with the open- 
loop gain) and corrected by feeding the output to the trim input. The 
required trim voltage is stored in capacitor Ca. 

In the second phase of the oscillator the inputs of amplifier 2 are 
connected in parallel to those of amplifier 1 and its output now feeds the 

el trim input of amplifier 1. With the 

| charge remaining across Ca, amplifier 2 
continues to be nulled and thus corrects 
the offset of amplifier 1. As the 
oscillator switches back to phase A this 
correction voltage remains across 
capacitor Cb. With the high open-loop 
gains of both amplifiers the offset 
voltage is now reduced to microvolts. 
Since the correction is done repeatedly, 
temperature drift is also much reduced. 

There is an additional benefit. 
Anything sensed by amplifier 2 below 
the switching frequency is treated as an 
offset. This includes flicker (1/f) noise, 
which is completely eliminated. Above the switching frequency the 
behavior of the auto-zero amplifier is identical to a regular op-amp. 


Fig. 8-23: Auto-zero op-amp. 
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Apart from the higher current consumption because of the additional 
circuitry there is one drawback: switching noise. At the switching 
frequency there is a noise peak which also causes (intermodulation) 
distortion. This effect can be ameliorated by changing the switching 
frequency at random, i.e. creating a spread spectrum. 


Distortion in an Op-Amp 


An op-amp is basically a non-linear circuit. The input stage, for example, can 
accommodate only a small differential voltage before it is limited by the input devices, 
both bipolar and CMOS. 

Feedback reduces the distortion caused by these limitations. Increasing the 
amount of feedback increases the linearity. 

If you increase the open-loop gain by a factor of 10 (20dB), distortion drops 
by a factor of 10, assuming that, by increasing the gain you have not added more 
distortion. 


The Miller Capacitance 


In 1919 John M. Miller was physicist with the National Bureau of Standards 
when he wrote a paper on how the grid capacitance of a vacuum tube was so much 
larger in use than measured statically. The voltage gain, he said, multiplies the 
capacitance between grid and plate. What he described has been known as the Miller 
effect or the Miller capacitance ever since. 

Miller went on to doing research at Atwater Kent, RCA and the Naval 
Research Laboratory. In 1953 he was awarded the IRE Medal of Honor. 

The exact same effect was found in both the bipolar and MOS transistor. In 
most applications it is detrimental, limiting the frequency response; in IC op-amps, 
however, it has been helpful, greatly decreasing the size of the compensation 
capacitance. 


John M. Miller: "Dependence of the input impedance of a three-electrode vacuum tube upon the 
load in the plate circuit", Scientific Papers of the Bureau of Standards, 1920, pp. 367-385. 
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9 Comparators 


To most people a comparator is merely an op-amp without feedback. 
With the very large open-loop gain the output abruptly traverses the entire 
available voltage range when one input passes the level of the other. 

i roe This is true for the majority 
of comparators, but there are also 
\Q@k* some refinements and variations. 
Let's examine them. 

The first circuit is indeed of 

the common variety: an input 
ye differential pair (Q1, Q2), a current 
mirror active load (Q3) and a second 
= = = : stage (Q4), giving a voltage gain of 

Fig. 9-1: Simple but accurate bipolar about 95dB. ’ 
comparator. The second stage is run at 
half the current compared to the 

input stage, so that it switches when the differential pair is in balance. It 
uses a separate current mirror (Q7, Q8) for a good reason: Q7 saturates. If 
we were to run Q7 off Q5 (as Q6 is), it would grossly decrease the collector 
current of Q6 as it saturates. 

This comparator, using bipolar transistors, requires a small input 
current; with an operating current of 50uA (25uA for each input transistor at 
balance) and a minimum hFE of 100, that amounts to 0.25uA. We could of 
course decrease the operating current, but at the expense of speed and noise. 

Also, the reference voltage (i.e. the common-mode voltage) cannot 
drop below the VBE of the input transistors (plus the saturation voltage of 
Q6), otherwise the input stage is simply cut off. At the upper end the 
common-mode range stops at about 0.2V below Vcc, when the input 
transistors saturate and cut off Q3 and Q4. On the other hand, Vcc can be 
as low as | Volt. 

A simulation for a high-gain circuit like this one is best set up by 
connecting two voltage sources to the inputs. One is steady DC (say 1.5 
Volts) while the other one is swept from ImV below this value to 1mV 
above it. You will see the output change drastically very close to the zero 
difference at the input. There is very little built-in error because Q1, Q2 and 
both sides of Q3 operate at the same collector-base voltage; there is only a 


\er" 
50u 


Q5 
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small second-order error due to the fact that the collector-base voltage of Q4 
is larger. 

But don't let this observed accuracy fool you into believing that this 
is what will happen in production. Move on to a Monte Carlo analysis and 
you will find that the offset voltage of the differential pair moves the 
switching point (by about + I1mV, depending on the process and the size of 
the transistors). 


The bipolar design can te M all 


be directly translated into | | 


CMOS, with a logic stage can 
added at the output. The gain * 
is now 110dB. 

There is no (DC) input 
current, but the limitations 
concerning the common-mode 
range still apply. 

In a CMOS circuit | 
saturating current mirrors need ad 
not be feared; the current in M8 —_ 
can be derived from M6. The 
fact that the drain of M8 can 
end up very close to ground has no adverse effect on M7. A word about 
the transistor dimensions: the logic stage at the output is designed for a 
0.35u process, all other channel lengths and widths need to be this large 
even for a process capable of smaller sizes. M1 through M4 require a large 
area for adequate matching (in fact, offset can be further reduced by 
increasing their sizes) and the Sum channel lengths reduce dependence on 
supply. 

Quite often hysteresis is required in a comparator, i.e. the threshold 
is higher when the input increases and 
lower when it decreases. For 
example, if you have a "low fuel" 
warning light you don't want this light 


Fig. 9-2: CMOS version of figure 9-1. 


Vec 


Q5 50uA Out 


Q6 


Q2 


Q7 


400 


to flicker on and off as the fuel sloshes 
in the tank, so you set the threshold to 
a low level as the fuel is consumed 
and to a higher level as the tank is 


ain filled. In figure 9-3 two features have 
od = been added. Replacing the simple 


Fig. 9-3: Comparator with hysteresis. current mirror, Q3 and Q4 form a flip- 
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flop with precisely controlled gain, giving the circuit a snap action. In 
addition, the diode connection of Q4 makes Q5 into a current mirror (each 
collector sources one-half of the current). This current is fed into a 400 
Ohm resistor, causing the reference 
voltage appearing at the base of Q2 to * vii Ta 
increase by 10mV (the resistor value a. 
can, of course be changed to increase “quot 
or decrease this value). The voltage at \ 
Ref must be capable of sinking the arts \ 
collector current of Q5. re TyrvOn 
As the input voltage decreases [ N 
from some value above the reference ag zn 
voltage, the current out of the output Vi 
terminal abruptly increases from zero 
to about 25uA (the exact level depends Fig. 9-4: Switching levels with 
on the output voltage because of the hysteresis. 
Early effect). You now have to 
increase the input voltage to 10mV above the reference level to turn the 
output current off. 
Figure 9-5 shows the same circuit in CMOS, with the current output 
(M8) opposed by a current sink of half the level (M13), and a logic stage 


{Vcc 
M9 
| 
| 
W=2 
L=0.5 Out 
M10 
=1 
L=0.5 


Fig. 9-5: CMOS comparator with hysteresis. 


added. CMOS has an advantage here in that the custom sizing of the 
transistors allows the amount of positive feedback to be set in precise 
increments (M3-M6). Note that the operating current (Ibias) has been 
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reduced and the value of R1 increased, resulting again in a hysteresis of 
10mV. 

Ibias will most likely be derived from a resistor value (and, perhaps, 
a bandgap reference voltage). R1 will track this resistor value and thus the 
hysteresis is remarkably accurate and stable with temperature. 

A comparator with hysteresis requires some thought before 
simulating or testing. The two different thresholds have to be approached in 
the proper sequence. In a simulation this can be done with a transient 
analysis, i.e. letting the input voltage increase until it exceeds the upper 
threshold, then decreasing it until a level below the lower threshold is 
reached. Similarly, in testing the input is ramped up until switching occurs, 
and then ramped down until the output changes states again. 


In the examples so far NPN or : 
N-channel transistors have been used as " i a ato 
for the input differential pair. This is a N 
disadvantage when input signal and 
reference are near ground level, unless mG a i: 
you have a split power supply. 


Q Q4 
By converting the input to PNP ‘ on 


or p-channel and a couple of design | i r t Kar 
refinements, a comparator can be made “00 as a6 


to work at ground level, even if there 
is only a single supply. - = = == = 

In figure 9-6 a Darlington input Fig. 9-6: A Darlington input allows the 
stage is used not to decrease the input MyEpi eee arQungerer 
current, but to allow the comparator to 
operate even if the input drops slightly below ground. With as much as 
400mV below ground at the input Q1 is still in its active region. At that 
point the base of Q2 is about 200mV above ground (at room temperature), 
which is sufficient to keep Q5 from being cut off. (Strictly speaking the 
input stage is not a pure Darlington connection, since the collectors go to 
ground). 

This circuit has a definite upper temperature limit (about 100°C) and 
is rather slow because there is no discharge path at the bases of Q2 and Q3. 
Since the primary object is not a low input current, however, there is no 
reason why we could not place two additional small current sources (like 
Q9) at these points. 

Though rarely needed in an ASIC, the ideal comparator has a 
rail-to-rail input (it already has a rail-to-rail output). 


TOsuB 
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This is actually quite easy to achieve: two input differential pairs, 
one n-channel, the other p-channel; mirror the currents of one and sum the 
result with the currents of the other. 


a a = = Vec 
M5 | Lv [ve 
P I 
W=20u =20.3u W=20.3u =20U 
L=1u L=1u L=tu L=1u 
M=2 M=2 M=2 M=2 "i 
' a i | M15 
W=20u 
In Mi M2 Ref L=tu 
r| M=2 W=2u Out 
be a w—a L=0.5y J 
W=20u W=20u 
L=5u L=5u M16 
M18 M19 M14 
| eres 
L=0.5 
ce 
W_20u W=20u W-t0U 
L=2u L=2u L=2u 
M3 M4 
| i: ch a : 
w=20 W=20u 
L=5 =5u 
M9 M10 M11 M12 
=20u W=20u =20u W=20U 
L=1u  L=tu L=1u  L=1u 
M=2 M=2 M=2 M=2 


Fig. 9-7: CMOS comparator with rail-to-rail inputs 


In this example the active load of figure 9-5 was chosen, again with 
sufficient positive feedback to give a snap-action (M5-M8). Note that M5- 
M8 and M9-M12 have a considerably large w/] ratio compared to the 
corresponding M1 - M2 and M3 - M4 to allow the input to go slightly 
beyond Vcc and ground. 
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Current Comparators 


T ve When you use the word comparator you 
yy automatically assume that voltages are compared. 


But this does not always have to be the case, 
A as lc sometimes it is useful to compare currents. 
With a simple current mirror (Q1, Q2) 
\em evena small difference in the magnitudes of I1 
ait ie and I2 will show up quite drastically at the base 
of Q3, turning it on or off. 
The base-current error is eliminated if 
Fig. 9-8: Bipolar Current Ibias is set at twice the level of I1 and 12. The 
comparator. only remaining error is due to the Early effect of 
Q3, which is easily reduced by adding another 
NPN stage and using a more sophisticated current mirror in place of Q4. 


The CMOS version is | ie at ‘ical 
almost identical. There is of H e TWF 
course no base-current error, \e me | wae we 
but the comments above ™ | a 
about the Early effect (or A . | i 
channel-shortening) apply. 1 I it ie 
Yet even without any 7 | - wee 
improvements both circuits | | rin |S ee wo 
switch abruptly within ie wat 
0.0006% over a wide Mista 


temperature range. Matching — _— = = 7 
variations, however, are 
another matter and you may 
have to make the input current mirrors quite large to get enough accuracy. 
Find out with a Monte Carlo analysis. 


Fig. 9-9: CMOS version of current comparator. 
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10 Transconductance Amplifiers 


For a while it looked like there would be a second universal building 
block, and the concept was called the Operational Transconductance 
Amplifier. But the OTA has rather severe limitations and there is no danger 
that it might de-throne the op-amp anytime soon. 

Let's examine the concept using a simple bipolar design. Just as in 
an op-amp there is a differential input pair (Q1, Q2). Its collector currents 
are mirrored separately by Q3-Q5 and Q6-Q8. One of the mirrored currents 


' +sv goes directly to the output, the second 
me oo» Kari one is mirrored again (by Q9-Q12) and 
| —) then opposes the first one at the output. 
aro» Kare No matter what value is chosen for the 
il ‘ i operating current (Igain), the two 
Lk ar a2] Output currents at the output have the same 
iit value (without an input signal) and the 
,_flead_ = output voltage is at ground. 
- Ignore Rload for a minute. As 


= we have seen in chapter 4 a bipolar 
transistor has an emitter resistance 


k kK 7 
Q4 me 


Q 
ae 
lr, — 
a3 as a8 q* Ie 
fo 


where k and q are constants, T is the 

i [Usus Key : : ‘ 
temperature in Kelvin and Ie is the 
emitter current. The term for r, is 
dynamic emitter resistance (often called 
"little re") because it changes with 
emitter current. At Ile = 1mA it amounts 
to roughly 26 Ohms (at room temperature), at 1O0uA 260 Ohms, at 10uA 
2.6kOhm, i.e. it is inversely proportional to Ie. (There is also a constant 
resistance in series with re, the physical resistance between the emitter 
contact and the base-emitter junction; this becomes significant at higher 
currents). 
The transconductance of a bipolar transistor is simply: 


Fig. 10-1: A simple bipolar 
transconductance amplifier. 
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Thus with an emitter current of 100uA the transconductance is 
(1/260) Ohms, i.e. a 1mV signal at the base cases a change in collector 
current of 3.8uA. In a differential stage the transconductance is half of that 
since there is an r. in each transistor (and we double the total current so that 
each emitter receives 100uA). 

In figure 10-1 the currents are mirrored in a ratio of 1:1 so that the 
collector currents of Q1 and Q2 appear unchanged at the output. With no 
signal at the input they cancel each other but, as one input is moved up or 
down, one current becomes larger and the other one smaller by the same 
amount. Thus the total transconductance is doubled and we have the same 
value as for a single transistor. 

Without some DC resistance at the output a transconductance 
amplifier is really quite impractical. Even the slightest mismatch in any of 
the transistors would slam the output voltage into one of the supply rails; we 
need some impedance like Rload to keep this voltage near the center. Rload 
converts the current output into a voltage output, which means that we no 
longer have a transconductance amplifier but simply a voltage amplifier 
(with a high output impedance to boot). Very few of the OTAs are actually 
used as transconductance amplifiers. 

With Rload back in the circuit the total voltage gain is now simply: 


RI 
A, = ee = (i } Te* Rload 
kT 


v 


r 
e 


So we have an amplifier whose gain can be varied (over a wide 
range) by varying a current . 

And herein lies the problem. The input signal also varies the 
current, and thus the gain changes with the amplitude of the signal. The 
result: distortion. With a small signal at the input this may be tolerable for 
some application, but not with a large signal. Here is the tally: 


Input Signal Igain=1uA | Igain=10uUA |Igain=100UA) 
Gain 
-5dB 14dB 32dB 
Distortion 
0.3% 0.2% 0.1% 
1.2% 0.9% 0.3% 
6.2% 5.1% 1.6% 
16% 15% 8% 
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So, if you have to handle a signal greater than about 20mVp, this 
circuit is a poor choice. You cannot use feedback or emitter resistors (for 
Q1 and Q2) to linearize it, it would interfere with the variable gain. 

There is another problem, not just for this circuit but for all such 
schemes: offset. A mismatch in not only the input stage but all three current 
mirrors will show up as an offset (and added distortion) at the output, 
increasing in magnitude as Igain increases. In this circuit this amounts to 
60mV worst-case at 100uA. Also, remember that bipolar transistors have 
input currents. 


There is help, though, and as usual you need to add a few more 
devices. If we — . 


+5V 


connect diodes to a3 as th 
the inputs and "Ql 
feed the signal in sour a4 a6 i ~ 
through a resistor, — Output 
we have os t 
something very aie Rload 

ae é Qi7 30k —l_ 
similar to a K K = 

rf In+ In- 
current mirror $——+—Kar oy 
(e.g. figure 3-1). a Re ea 
; oe 22k at Na {ars 

The input - 
impedance 1s low © lee | a 13 Q| ' a2} « {ara 
because of Q16 ah: 20u sua 20u 
(at 20uA re 7 “ * #_)sv 
amounts to Fig. 10-2: An improved version of the bipolar 
1.3kOhm at room transconductance amplifier with linearizing diodes. 


temperature) and 

R22 converts the 

input voltage into a current. A 500mVp input signal causes so little change 
in voltage at the base of Q1 that the distortion is down to 0.3% at 1uA and 
0.01% at 100uA. The offset voltage still persists, amounting to 60mV again 
worst case at 100uA. 

Both sides of the input pair need to be treated equally, including the 
addition of the dummy resistor R2 to avoid worse offset problems. The 
current mirrors used here are of the highest precision, sacrificing low 
operating voltage for accuracy. 

Q15 aids to remove the base currents for Q16 and Q17 from I1, but 
even with this measure the ratio between I1 and I2/I3 needs to be precise; 
any mismatch will increase the offset voltage and the input current (250nA 
max. with perfect matching). 
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The one great 
feature of a circuit like 
this is the precise control 
of gain over a wide range. 
Figure 10-3 shows the 4 
gain (in dB) vs. Igain. A 
linear change in current 
results in a logarithmic 
change of gain. Thus, for = 
audio applications, you 
can control the volume (a 
logarithmic function) 
with a linear current (or 
voltage). The accuracy is 
within +0.2dB. True to Fig. 10-3: Gainisa precise logarithmic function of 
the exponential nature of the Hain: 
base-emitter diode, the gain 
changes 20dB per decade of current. 

Because the diode-connected transistors (Q16, Q17) track the input 
transistors, gain is 
virtually unaffected by 

temperature. If Igain is 
ian pou derived from a resistor 
made from the same 
0 | layer as R1, R2 and 
Igain = 10UA i . 
tabinclA Rload, the gain is also 
unaffected by absolute 
variations. 
ae Figure 10-4 
shows the waveform that 
‘i appears at the output. 
There is a very large 
# change in the level of the 
: 7 . = ie signal, which is of course 
the purpose of the circuit. 
Fig. 10-4: Output waveforms (1kHz). Because of the offset 
voltage a 
"transconductance" amplifier is best suited for audio and filter applications 
with the output capacitively coupled to the next stage. 


Gain / dB 
i) 


100n 200n 400n dp 2u 4u 10u = 20u 40yu 100p 


gain / A 


Output / V 


Time/mSecs 200uSecs/div 
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The concept works in CMOS too, but the fundamentals are different. 
CMOS transistor naturally takes a voltage at the gate and delivers a 


A 
current at the drain, and this transconductance varies as the square of the 
operating current. 
. ' . ee Figure 10-5 is the 
oa | ns ce same configuration as figure 
cea we 1 Ko 10-1, with NPN transistors 
Meo | Meo wee | Mae replaced by N-channel 
devices and PNP transistors 
all | Nn all | thn by p-channel ones. The 
BS aes ee devices are quite large (M5, 
ero | Met0 mero | Met0 for example, has a total 
— r= Le width of 200um with the 
In 2 nloss multiplier M set at 10) and, 
| ef oe "30 __L_ as we will discover shortly, 
wl W-20u if im ~ the sizes chosen are still 
ee fat woz marginal. 
Mes Ma5 In this and the next 
ah a r example a dual power 
\ il C supply of + 1.5 Volts is 
W200 w.20t used. This may be an 
impractical value for you, 
: . 7 “the choice was made to 
simplify the discussion of 
Fig. 10-5: A CMOS equivalent of figure 10-1. input and output DC levels. 


In the real world you may be forced to use a single 3-Volt (or 3.3-Volt) 
supply, in which case the inputs and the output have to be biased at half the 
supply voltage. 

This circuit uses a 0.35um process, necessary because the high- 
accuracy current mirrors cannot tolerate an output voltage of less than about 
0.6 Volts across them. If you were to use a process with smaller 
dimensions you would have to reduce each mirror from four to two devices 
and pay the penalty of much reduced accuracy. 

A CMOS transconductance amplifier suffers from the same non- 
linearity as a bipolar one. Distortion is tolerable only for small input 
signals. With a +40mVp input it amounts to 0.1% at 100uA, 0.7% at 10uA 
and 1.4% at luA. When the signal is increased to +75mVp (which results 
in the maximum output swing possible, + 0.9V) the distortion increases to 


0.8% at 100uUA, 2.3% at 10UA and 4.5% at 1uA. 
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In the gain vs. operating current plot (figure 10-6) the range is 
extended down to InA 
just to show the wide 
range achievable. You 
notice, however, that at 
the high-end the circuit 
deviates from a pure A 
logarithmic behavior; to 
straighten this line the 
transistors would have to 
be even larger. 


Gain / db 
ie) 


Transconductance 6 
of an MOS transistor is i 
temperature dependent in 2n 4n 10n20n40n 100n 400n tp 2u 4y 10u20u40n 100 
and thus there is a Igain / A 
decrease of gain (at any Fig. 10-6: Gain (in dB) vs. Igain. 


current) of about 2dB from 

0 to 100°C. Also, a CMOS transconductance amplifier has the same offset 
problem as a bipolar one; at 100uA this amounts to + 30mV. Unlike the 
bipolar version, however, this circuit has no DC input current. 


A rather complex scheme has been developed to linearize the input 
stage and still have gain control (see references). To do this 15 more 
transistors are needed (M15 to M29, figure 10-7). 

M17 is the key device. This "diode-connected" transistor is the 
same size as M1 and M2, the input differential pair, and all three devices 
share a mirrored [gain current (M23) at their sources. The drain/base node 
of M17 receives the same amount of current from M24 through the current 
mirror M26 to M29. M25, a cascode transistor, has been added to improve 
the matching of the mirrored currents. 

The current into the drain/base node of M17 is also shared by M16 
and M18, but their current is governed by the fact that they each are part of 
another differential pair (M16/M19 and M18/M15) whose operating 
currents are set at (3/4)Igain. M15 and M19 are twice as wide as the other 
five devices in the input row. 

This complicated use of ratios serves to extend the range of input 
voltage over which the differential pair is linear. The optimum is reached 
with a ratio (between M15/M18 and M19/M16) of 2.155, exceeding + 1 
Volt. For our case here this is of little consequence, 75mV causes an output 
swing of + 0.9 Volts, the maximum the circuit can handle. 
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At this level the distortion now amounts to 


Chapter 10: Transconductance Amplifiers 


0.1% at 50uA, 0.2% at 


5uA and 0.3% at 0.5uA. 
= = 7" = a iF +1.5V 
M26 M27! M4 M8 
“ines P P 
cee | ee er ‘a 
M=2 M=2!) M=2 M=2 
Igain 
M28 M29) 
8 cae 4 mig 
rt 
‘ee w=20) | | 
=2u L=2 W=20 W=20) 
=10 M=1 (at iat 
a M=1 M=1 
* = __|Output 
| M1 M15) M16 M17 M18 a] M2 n+ 
| | | | | 
eT ibe ae = , 
es) Sew aie es. eae eae late Fad 
M=2! M=4| M=2 M=2 M=2 M=4 M=2 ip 80k 
7 M11 M12) 
r = | - | 
a 
M-5 M=5 
* W=10u 
M20 M21 M22 M23 M24) reins M13 M14) 
| | | | | ‘ || | 
| 
ay oa fa te 
M=4 M=3 M=3 M=4 M=4 
- = = = _—/ = a 1.5V 
Fig. 10-7: Transconductance amplifier with linearized input. 
With the device 
2 dimensions shown the 
circuit cannot handle 
1 
much more than 50uA 
(Igain) and starts 
deviating slightly from 
$4 the ideal logarithmic 
5 : 7 line above 20uA. 
Because of the many 
3 | additional devices there 
is also a slight 
-4 ey ce 
deviation below about 
10nA. Even so, the 
10n 20n 40n 100n200n400n itp 2u 44 10p 20u 40py gain control has a range 
gain /A 


Fig. 10-8: Gain (in dB) vs. operating current (Igain). 
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of more than 70dB. 
Note that the 
output impedance is 
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rather high; a buffer may be needed. 

The problem with the offset voltage is still present, only slightly 
reduced by the lower gain. Figure ona + 20mV uncertainty at the output. 
For this reason transconductance amplifiers are primarily used in audio and 
filter applications where the output can be capacitively coupled. 
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11. Timers and Oscillators 


Summer of 1970. The economy was at the bottom of the cycle and 
Signetics, the promising young company I had joined just two years before, 
laid off half of its employees. 

Disgusted with the turn of events, I decided it was time to strike out 
on my own and rented space between two Chinese restaurants in downtown 
Sunnyvale, California. Signetics (now Philips) lent me the equipment I 
needed and gave me a one-year contract to develop a new IC. 

The idea for the new IC came from the work I did at Signetics on the 
phase-locked loop. I had needed an oscillator whose frequency could be set 
by an external resistor and a capacitor and was not affected by changes in 
either supply voltage or temperature. Several products resulted from the 
basic design, among them the NE566 Voltage-Controlled Oscillator. 

The oscillator 
contained first of all a 
voltage-to-current 
converter. The reference 
voltage at the positive 
input terminal of the op- 
amp is not regulated, it is 
simply a fraction of the 
supply voltage. Feedback 
to the op-amp keeps the 
voltage across the 
external resistor at the 
same level and thus the 
current through the 
resistor becomes 


5/6 Vcc 


. . ; (1/6* Vcc)/Rext. 
Fig. 11-1: The basic 566 Oscillator. In the actual D A éh 
circuit the comparators and the flip flop are combined “penis Vat 
in one Schmitt trigger. state of the switch 
controlled by the flip- 


flop, the external capacitor is either charged with the current, or discharged 
with a current of the same magnitude through a 1:1 current mirror. 
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There is a divider with three identical resistors, producing 1/3 Vcc 
and 2/3 Vcc at the two taps. Two comparators, referenced to these taps, 
watch the voltage across the external capacitor. If it moves above 2/3 Vcc, 
comparator | sets the flip-flop, the switch diverts the current to the mirror 
and the capacitor is discharged. When the voltage across the capacitor 
reaches 1/3 Vcc, comparator 2 resets the flip flop and the capacitor is 
charged again. 

This endless cycle 
produces a triangle-wave. The 
amplitude is dependent on the 
supply voltage, but so are the 
charge and discharge currents 
and the two effects cancel each > 
other. Except for small errors 
inside the IC, such as the offset 
voltages of the op-amp and 
comparators and the matching 
in the current mirror, the 


frequency is exactly: Time/mSecs 2mSecs/div 
Fig. 11-2: Triangle-wave produced by 
1 the 566 oscillator. 
f= 3% REC 


What I proposed to Signetics was this circuit, modified so it could 
also be triggered and produce a single cycle only, i.e. it would be both an 
oscillator and a timer. 

The project almost didn't get off the ground; the engineering staff 
didn't think much of the idea. Timers at the time were put together from an 
op-amp or comparator and a few discrete components, including a Zener 
diode or two. They argued that such a design would cut into the sales of 
their present ICs. But the marketing manager, Art Fury, over-rode them; a 
man with immense practical experience, he simply had the gut feeling that 
such a timer would sell. 

It was a one-year contract and designing the circuit took half of that. 
No computer analysis then, the circuit had to be laboriously breadboarded. 
When everything was working I wrote a development report and gave a 
design review at Signetics. The design passed without any comments. 

But something wasn't quite right. I felt that I had missed something, 
that I could do better. It bothered me that the design required nine pins, 
which was about the most unfortunate number I could have picked. There 
was an 8-pin package; the next higher number was 14. 
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I started on the layout. In 1971 this meant sitting at a drawing board 
for several weeks, fitting devices together into the minimum rectangular 
shape and checking each dimension by hand. Then, about two weeks after 
the design review, on the way home after work, it suddenly hit me: what 
would happen if I got rid of the voltage-to-current converter and charged 
(and discharged) the capacitor directly with a resistor? That would bring 
the pin-count down to eight. 

I made a U-turn, went back to work and tried it. Sure enough, the 
timing didn't change as I varied the supply voltage. It was my own 
limitation that had made me assume that only a Jinear relationship between 
charge current and end-voltage would cause the cancellation effect. Even 
though the charging of a capacitor through a resistor causes an exponential 
rise of the voltage, the cancellation was just as effective. In fact, having 
eliminated the voltage-to-current converter, I now had not only a smaller 
but also a more accurate circuit. 

I made the changes in the circuit but didn't bother to request a 
second design review. I only told Art Fury, who was pleased; an 8-pin 
package was then significantly less expensive than a 14-pin one. 

It took another five months to draw the layout, cut the patterns on 
Rubylith (by hand), spend endless hours hunched over a light-table to check 
dimensions and connections (again: by hand, no computers), make a mask 
and a prototype wafer and then evaluate the IC, which Art Fury decided to 
call the NES55. 

In the meantime, one of my former colleagues left Signetics to join a 
start-up. The first circuit this start-up brought to market was the timer I had 
described in my design review. Time-wise they beat Signetics by two 
months, but when the real 555 came out, they had to withdraw their version 
very quickly. 

The market reaction to the 555 timer was truly amazing. Art Fury 
made history by bringing out the circuit at an unprecedented low price, 75 
cents. I had deliberately made the design flexible, but nine out of ten 
applications were in areas and ways I had never contemplated. For months 
I was inundated by phone calls from engineers who had a new idea for 
using the timer. To this day the 555 has been the best-selling IC every year, 
copied by numerous companies. Except for a CMOS version, the design 
has never been changed. 

Looking at the design now, 33 years later, there are many areas 
where it can be improved with the design techniques we have learned since 
and with the enormous benefit of computer simulation. So, let's look at the 
actual 555 timer and then a version which benefits from 33 years of 
progress. 
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Fig. 11-3 The original 555 timer. 


Both comparators use Darlington input stages. This makes the timer 
fairly slow, but allows an extreme range of external resistance. Comparator 
1 consists of Q1 to Q8. The four PNP transistors form a current mirror with 
gain, provided by the unequal emitter resistors. 

The output of this comparator feeds into a 4.7kOhm resistor (R11), 
which is part of the cross-connection in the flip-flop (Q16, Q1I7). 
Comparator 2 (Q10 to Q15) resets the flip-flop. 

The output stage, which must be able to sink or source some 
200mA, is controlled by Q20. In the high state the Darlington pair 
Q21/Q22 delivers the current, but at a cost of a voltage drop of about 2 
Volts. In the low state Q24 receives sufficient base current to work alone 
up to about 50mA; beyond that, as the voltage drop increases, Q23 feeds 
extra current into the base circuit. 

There are several flaws in this design, indicative of the early period 
of IC design (and the inexperience of a rookie designer). Neither 
comparator is well balanced, showing offsets of as much as 30mV. The 
circuit can get away with that because the voltage swing is quite large. 

The operating currents are quite large; the lateral PNP transistors run 
at up to 1!mA. That was acceptable at the time since the devices had 10um 
geometries; today it would be excessive. 
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The output stage consumes a considerable amount of current in the 
low state and, during switching, both output transistors are on for a brief 
period of time, producing a current spike in the supply. 


R1 I 1 utput 
91k Veco q 4 
| Threshold FM | (7) vec RC 
555 12 
» | Discharge Output > 
R2 4+ 
Trigger Reset |— tan 
Trigger | 
fefe) Gnd Trigger] 
C1 
In 20 40 60 80 100 120 140 160 180 


= = Time/jiSecs 20uSecs/div 


Fig. 11-4: Timer connection of the 555. Fig. 11-5: Timer waveforms. 


In the timer connection the period starts with a negative-going 
trigger pulse, which resets the flip-flop through comparator 2 and moves the 
output high. When the voltage across C1 reaches 2/3 Vcc, comparator 1 
sets the flip-flop, C1 is rapidly discharged and the output moves low. 
Despite the bad offset voltage the accuracy is quite remarkable: the error in 


timing is around 1% with a temperature coefficient of 24ppm/°C. The 
timing formula is: 


t = 11* RI*Cl 
12 
Ri Veo 1” 
50k R3 
Threshold ru l_ ¢100 C VM 8 
. 555 - : 
= Discharge Output . 
50k _—— — 4 
Trigger Reset |— 
Gnd 
=n 
C1 


—— 120 140 160 180 200 220 240 260 280 300 320 340 


Time/uSecs 20uSecs/div 


Fig. 11-6: Oscillator connection of the 555. Fig. 11-7: Oscillator waveforms. 


In the oscillator connection there are two external resistors and the 


voltage across C1 moves between 1/3 Vcc and 2/3 Vcc with a frequency 
and duty cycle of: 


14 R2 
f= ae DutyCycle = ——W— 
(R14+2R2)Cl R1+2R2 
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It is not quite possible to achieve a 50% duty-cycle; charging and 
discharging the timing capacitor through a resistor connected to the output 
is not such a good idea; the high and low voltage drops are unequal and 
have significant temperature coefficients. 

There is a CMOS version of the 555 and a redesign for operation 
from a single battery cell (see references), but the circuit is still being sold 
today in its original form, despite the fact that much better performance is 
possible with more modern design techniques. Here is my candidate: 


* 
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R1 Q4 Q31 
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Qi 7 “7 aQ14 . = 4 
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500, kar > K 
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tr 20k | Discharge 
on 
er Karz —#* 
34K [4 Re KO28 ae 
a2 a7 fer ard} 4kars 20k ase 
* * * — * — — 
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R13 
Q4t - 
Threshold 240» Vv Q44 Kose 
+—Koso 
O45 Lf Q47 
= * 6 
R10 $R11 $Ri2 R14 Lee 
fa20 ao a 7.5k $7.5k $7.5k 7.5k 
Trigger I —Kare aes . Kast 
Reset R 
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t+—Kozr 5 Kaas 
21 Q24 
Q Nl tosup ass ase 
Gnd aa & &- ss & e 8 
Comparator 2 Output Stage 


Fig. 11-8: An improved version of the 555 timer, 33 years after the original design. 


First off, the new timer gets a proper bias circuit (Q1 to Q5) to hold 
the operating currents more constant over the wide supply voltage range. 
This (and a few other steps) extends the operating voltage down to 3 Volts. 

Comparator | (Q6 to Q17) now has a balanced active load (Q15, 
Q16) which reduces the error in the timer mode to about 0.5% and the 
temperature drift to 3 ppm/°C without any loss in speed. The change in 
timing from 3 to 15 Volts is a mere 0.05%. 

There are two changes in comparator 2 (Q18 to Q27): a small 
operating current for the outer Darlington transistors, which greatly 
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improves switching speed, and a balanced active load, which makes the 
trigger level considerably more accurate. 

The flip-flop (Q28 to Q36) is a new design; it operates in a current- 
mode for maximum speed at the lowest possible current. The two 50uA 
currents generated by Q31 are split by a pair of lateral PNP transistors; one 
quarter of the current is fed into the base of the opposite flip-flop transistor, 
another quarter turns the reset transistor on and off and one half of the 
current is used to steer the output stage. The voltage swing at the collectors 
of the flip-flop transistors (Q30, Q36) is 2VBE. 

The most significant change is in the output stage. The base current 
for the lower output transistor (Q51) is no longer derived from a resistor. A 
small amount of current is injected into the bases of three transistors, forced 
to be equal by the three resistors R10, R11 and R12. This (plus an 
additional current delivered by Q45) starts a positive feedback loop formed 
by Q40, Q41 and Q42. Q40 is about seven times the size of Q41 and Q42 
has one emitter while the output transistor has 24. This loop then provides 
whatever current is needed to keep Q51 fully turned on. 

Positive feedback loops are always dangerous, they can run away or 
refuse to turn off. In this case the loop is contained by the collector 
resistance of Q43 and can be opened up by turning Q43 off. 

Replacing the Darlington configuration in the upper part of the 
output stage with a compound (PNP/NPN) transistor reduces the voltage 
drop. Base current for this part is provided by Q47. Q44, Q46 and Q49 aid 
in turning the power devices off rapidly and eliminate the large transient 
current. 

With these measures the current consumption is now down to 
0.85mA from 3mA (typical) at 5 Volts. At 15 Volts the circuit consumes 
1.2mA (down from 10mA). Minimum operating voltage is 2.5 Volts (-40°C 
to 100°C). 


Shortly after the 555 came out Intersil announced a CMOS version. 
It was (and still is) done in a 15-Volt process, which requires large 
dimensions and is inherently slow. The circuit is not directly compatible 
with the bipolar version, lacking high current outputs. 

Except for this weakness, CMOS is ideally suited for a timer: there 
is no input current and thus no need for Darlington stages. 

Figure 11-9 shows a design using a more modern 5-Volt (0.5um) 
process. The comparators are conventional (as discussed in chapter 9), with 
the dimensions of the devices chosen so that the threshold and trigger inputs 
can move rail to rail and their matching is adequate for precision operation 
(3ppm/°C). 
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Fig. 11-9: A 5-Volt CMOS version of the 555 timer. 


Ordinarily a flip-flop consists of two cross-coupled gates. In this 
case two cross-connected transistors fed by current sources result in smaller 
temperature and voltage drifts, because the flip-flop switch levels track the 
operating currents of the comparators. 

The operating currents are set by R1, which limits the operating 
voltage range in which high precision is obtained to 3 to 5 Volts. Replacing 
R1 with a current source extends this range down to | Volt. 

A CMOS output stage 
swings rail to rail (unlike a bipolar 
; aiescuane Sk, output which at the very least has a 

minimum drop of some 150mV, if 

| Discharge Output not an entire VBE). Thus the timing 
_ oo = resistor can be connected to the 

day Sea Heset output, resulting in a square-wave 
C1 with a precise 50% duty-cycle. On 
re hes the other hand, CMOS devices are 
inferior to bipolar ones when it 

Fig. 11-10: 50% duty cycle oscillator. comes to current handling. Even 
with a gate-width of 200um for the P-channel devices and 100um for the N- 
channel transistor, the circuit only delivers 10mA and the voltage drop is 
.25V (which badly affects the duty-cycle). It would be better to have 
separate outputs for the timing resistor and the load. 


Vec if 
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Fig. 11-11: A high-frequency triangle-wave generator. 


Figure 11-11 shows an oscillator which produces a precise triangle 
waveform even at relatively high frequencies. All transistors which 
determine speed are NPN and do not saturate. 

First the low-frequency part, the current sources used to charge and 
discharge the external capacitor. The primary current is produced by Rext; 
being connected between the positive supply and two VBEs the current 
through the resistor is not only dependent on the supply voltage but also has 
a temperature coefficient. Both of these effects are eliminated by using an 
internal resistor chain (R9, R10 and R11) connected the same way. 

The primary current is mirrored by Q6, Q7, Q9 and Q10 and then 
mirrored again (Q1 through Q5) to form the charge current. A second 
current of twice the magnitude is derived from the first current mirror by Q8 
and Q11. This latter current is used to discharge the capacitor and is turned 
on and off by the differential pair Q13/Q14. 

The internal resistor chain is used to bias the rest of the circuitry and 
provide the reference voltages for two comparators; the voltage across the 
three identical resistors is (Vcc - 2VBE). 
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Comparator | consists of a single differential pair (Q18,Q21) as 
does comparator 2 (Q23, Q25). They provide the operating current for the 
flip-flop (Q19, Q22) with two of their collectors while the other collectors 
switch the flip-flop's bases. The ratio of the resistors in this arrangement is 
key: the operating currents for the comparators is set by R12 and R13, 
which have one VBE across them. The two currents end up flowing 
together through either R5 or R6, depending on the state of the flip-flop. 

RS5 and R6 are one quarter the value of R12 and R13, so the voltage drop 
across them is one-half VBE. In other words, the collector voltages of the 
flip-flop drop 1/2 VBE below the base potential, which is safely above the 
saturation voltage. 

Now the question is: how do we get this small voltage fluctuation, 
located just below Vcc, to work on the bases of the differential pair 
Q13/Q14, which must operate in the voltage region below the low point of 
the wave-form? We cannot use lateral PNP transistors, they are far too 
slow. 

We do this by coupling the switching signal to the differential pair 
through two resistors (R3, R4) and running a known DC current through 
them. Q12 and Q1I5 are current mirrors, slaved to the bias chain (Q27). 
Their current thus increases as the supply is increased, and so do the voltage 
drops across R3 and R4. Thus the average potential at the bases of the 
differential pair stays at a fairly constant 1/3 Vcc, over an operating range 
from 9 to 15 Volts. 

The triangle wave-form 
across the capacitor is buffered 
8. by the emitter follower Q16 for 
use by both the comparators 
and an external load. At the 
unused collector of the 
- differential switching pair a 
square-wave can be obtained. 

This is not an oscillator 
of ultimate precision, but it 
delivers a good-quality wave- 
45005 is 2 25 8  formup to atleast 1MHz. The 

Time/uSecs soonsecsiv temperature coefficient is 
190ppm/°C and the change in 
frequency from 9 to 15V 
supply is 1.7%. As always, these results are based on one particular 
process; it is a good idea to re-simulate the design for the process you are 
using. 


7.5 


Fig. 11-12: Wave-form at 1MHz. 
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A triangle-wave can be Ey ‘ . A 
looked at as a sine-wave with i 
distortion. This is not so far- x on} 4 
fetched, because the distortion is > ie Se 
only about 12%. If we round pw. a3) + 
off the peaks, we end up with a a? aaa 12k 
fairly respectable sine-wave tw os) ; 
with relatively little effort. i ae fe) eps fe) zou 

Figure 11-13 shows a 
companion circuit to figure 11- aa, 
11. Itis inserted between points |. a jer | jez | jeer 
A and B, replacing R10. The ms ot 
triangle wave, entering through tv ap t 
R1, encounters attenuators at six on 124 
different voltage levels. Follow ry ie io 
a positive-going signal: at first = me om 
there is no attenuation; at about 206 a ee 
0.6 Volts R11 kicks in, held at -_ 
that level by Q6; at the next = : 8 
higher level R10 appears in Fig. 11-13: Shaping circuit, transforming a 


triangle-wave into a sine-wave. 


parallel to R11 and at the final 

level an even smaller resistor, R2, 

reduces the signal even further. 
Using only three clipping 


S levels in each direction, the 
a resulting sine-wave has a 
distortion of only 1%. Using more 
: levels reduces the distortion, but 
49 02 #04 06 08 1 12 414 16 1.8 2 is likely ” require trimming. 
Time/mSecs 200uSecs/div 


Oscillators (and timers) can 
often be very simple for non-critical 
applications. Suppose you wanted to create a brief pulse once a second, for 
example to flash an LED. The frequency or pulse-width need not be 
precise, i.e. the design does not require two sophisticated comparators; a 
simple Schmitt Trigger will do. 


Fig. 11-14: Triangle and sine-wave. 
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Figure 11-15 shows 
a bipolar design for such an 


az. 
LED flasher, intended for 
use at either 5V or 3.3V. ac #Litep 
Q3 and Q4 form a simple 


comparator, with Q5 as an 
active load. Rext charges 
Cext and when the voltage 
at the base of Q3 exceeds 


— Cext 


15k 


R7 
3k 


Enable 


R8 
45k 


R2 


RQ 


that at the base of Q4, Q6, 
Q2 and Q7 turn on. 
Through the voltage divider 


TO suB 


30k 


1.5k 


R10 
100k 


(pinched) 


as} 


R6/R7/R8 Q7 now abruptly 
lowers the potential at the 
base of Q4, while Q2 


Fig. 11-15: Circuit generating brief current pulses. 


discharges Cext through RI. When the voltage across the capacitor drops to 


the new level at the base of Q4, Q6 turns off 


and the cycle starts anew. 


The frequency is set by Rext and Cext (here 1 Hz) and the pulse- 
width by R1 and Cext (20msec). The frequency is accurate to within 2% 
from 3 to 5.5 Volts (not counting the variation of the external components), 


|(LED-anode) / mA 
RC/V 


LED Current 


12°14 16 18 2 22 24 26 28 


Time/Secs 200mSecs/div 


Fig. 11-16: Wave-forms of the pulse generator. 


but the pulse-width reflects the 
variation of R1, a diffused 
resistor. 

LEDs have a rather 
large forward voltage drop 
(about 2 Volts), so a supply 
voltage of at least 2.5 Volts is 
required. The current through 
the LED (40mA) is primarily 
determined by the size of Q1; 
it operates in the high-current 
region, where the gain has 
already decreased but the 
spread of hFE becomes 


narrow. Average current consumption is 1mA. 
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. pT Usev It is interesting to 
2 ee aes consider a CMOS design 
to. 36u for the same function. 
Me ml ‘| We can avoid using a 
ree 4 resistive divider (and 
140.84 [Lad <te0 thus save current) by 


using two comparators 
and making two of the 


He wa i ae pl 4 LeD transistors in each 
Lo Ra i: comparator nine times 
aie 7 wit the size of the others. 
it ——1 re 03e4 This results in an offset 


of some 200mV. The 
reference potentials for 


—L 330n 


“pout gai | Me , the comparators are the 
fey went supply lines. As the 
iar a voltage across the 
— s : oe >< external capacitor rises to 
200mV below the 


Fig. 11-17: CMOS design for a pulse generator. Pome UEP ERS 
comparator (M1 to M4) sets the flip- 
flop (M12 to M15); as it falls to 
200mV above ground the lower 
comparator resets it. Cext is charged 
by Rext (1 second) and discharged by 
R1 (20msec). The two times are 
surprisingly accurate, exhibiting a 3% 
change from 3 to 3.6 Volts and 0 to 


RC/V 


100°C. : | _ Grgund + oom 

The output current, on the other ig 232 24 26 28 3 32 34 
hand, shows the weakness of CMOS: Time'Sees 200mSecsidiv 
it varies + 21% with a supply voltage Fig. 11-18: Waveform at RC of 
change of + 10%. For this reason an figure 11-17. 


even large output transistor (M16) and a 
resistor in series with the LED may have to be used. 

With a 20msec pulse of 37mA every second, the entire circuit 
consumes just 650uA average. 
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A second example, a timer this time, shows just how low a power 
dissipation can be achieved if a resistor divider is avoided: the circuit in 
figure 11-19 draws 1uA at 1.8 Volts. 

The entire circuit (save the output inverter) is powered from the 
Start pulse, rising from ground to Vdd. The logic input must stay high 
longer than the set 


Seat ai “| be 4 ““ timing. Alternately, if 
aly | | the Start terminal is 
W-=5.2u W=5.2u at Mio 5 
my pote rs X L=0.18u| L-0.1gu it simply connected to Vdd, 
10Meg | 2 1, ‘4 the circuit becomes a 
a Zz 5 
” | Lou || start-up timer. 
RC M ve | Ws 
ie ie L-0.18u M1 to M6 form a 
as ' a ++ es comparator. By making 
Lou 1 Lory) UN | We MI ten times as wide as 
M13} a 
. tei re M2, an 80mV offset is 
To ee ose bette, created. The gate of M2 
pa is connected at Vdd, thus 


—- = = the comparator switches 
at 80mV below Vdd. 
The switching action is 


Fig. 11-19: A 1.8-Volt timer which consumes just 1uUA. 


enhanced by employing a small 
amount of positive feedback; M4 
and MS are slightly wider than 
M3 and M6 and deliver their 
drain currents to the opposite 
sides (see also figure 9-5) > 
The operating current for 
the comparator is provided by 
M7, a long and thin transistor. 
The advantage of such a device is 
size: it produces about 0.6uA 


0 20 40 60 80 100 120 140 160 180 200 220 


using a relatively small area; a as eerore 
resistor doing the same job Fig. 11-20: Switching threshold of the 1.8V 
(1.2MOhms) would be painfully timer. 


large. 

On the other hand an MOS transistor with its gate connected to the 
supply is hardly a constant current source. With a 25% change in supply 
(1.6 to 2 Volts) the current changes 70% (0.5 to 0.85uA). Here we can 
afford to live with this shortcoming. 

The two outputs of the comparator are level-shifted by M8 and M10 
and the active load M9/M11 to fit the input of an inverter. 
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A circuit operating at such a low current is of course quite slow; a 
significant timing error occurs below about 50usec. At lower speeds you 
can expect an accuracy of better than + 5% from 1.6 to 2 Volts and 0 to 
100°C. 


Simulation of Oscillators 


You have just drawn up a great idea for an oscillator and start the 
simulation. Nothing happens, you get nothing but DC levels. 

This situation is all too common. The simulator is trained to first 
find an operating point, i.e. set the DC voltages and currents so all the 
device equations are satisfied (in simulator-speak: find convergence). Then 
the transient analysis starts and the computer finds that all the voltages and 
currents remain unchanged over time. 

In real life the circuit may start at exactly the same point, but no 
voltage or current stays unchanged, there is noise. No matter how tiny these 
fluctuations are, they move the circuit to a slightly different operating point 
and it becomes apparent that movement in one direction is the path to be 
followed. Thus, gradually, the oscillation builds momentum. 

Without any noise (or some sort of transient disturbance) no circuit 
would oscillate; it would just sit there, precariously balanced. 

It is of great advantage to have a simulator which allows real-time 
noise (i.e. all currents and resistors actually produce the appropriate amount 
of noise not only for a (small-signal) ac simulation, but during a transient 
analysis as well). With this feature a properly designed oscillator will 
always start. If you are using a simulator which has no real-time noise, you 
may have to coax the oscillator by jarring it with a pulse, e.g. step the 
supply voltage abruptly to a higher level. 

But there is a second potential problem: it may take a long time for 
the oscillation to build up. For the circuits discussed so far in this chapter 
this is no great worry, but for the type of circuits we are about to encounter 
this can be very frustrating. 

Take a crystal oscillator, for example. A high-quality crystal can 
take up to a second to start oscillating. At 1OMHz, that amounts to 10 
million cycles the simulator has to go through, in very small time steps to 
catch any movement. If you are not aware of this nuisance, you may sit 
there watching flat lines and come to the conclusion that your oscillator 
does not work. 
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Oscillators using inductors are rarely used in integrated circuits, 
except at frequencies above GHz. But there are occasions at lower 
frequencies when only an inductor can give you the performance required. 


One such example is shown in figure 11-21. 
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_fosus 


Fig. 11-21: Sine-wave oscillator with large 
amplitude. 


transformer with two windings for 
the simulation. 

The collectors of the 
oscillating transistors start at the 
supply voltage, 10 Volts. After a 
few hundred cycles of gradually 
increasing amplitude the waveform = 
at each collector is limited by the 
emitters of Q5 and Q6 at the 
negative end; at this point the peak- 
to-peak amplitude has reached 21 
Volts, more than twice the supply 
voltage. The action of the center- 
tapped inductor is that of a see-saw: 
one end dips to ground (or slightly 
below) while the other peaks at a little 
above 20 Volts. 
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The object of this 
design is the creation of a 
10MHz sine-wave with a 
large amplitude. Q8 and 
Q11 are the oscillating 
transistors, cross-coupled 
by the collector-base 
capacitances of Q9 and 
Q10 (about 8pF each). 

A small current 
(about 12uA) is injected 
into the bases of Q8 and 
Q11 to bring them into a 


current level at which there 


is sufficient gain. TX1 is 
in reality a center-tapped 
inductor, shown as a 


+10V 


Time/nSecs 


550 600 650 700 750 


Fig. 11-22: The voltage swing of the 
oscillator extends to twice the supply 


voltage. 
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Though the supply voltage is 10 Volts, the four transistors Q8 to 
Q11 must have a voltage capability of 21 Volts. 

To minimize the output capacitance, each oscillating transistor can 
share a collector region with its cross-coupling capacitor. 


Crystal Oscillators 
Let's start with the circuit commonly used in CMOS: the crystal is 


connected between the input and the output of an inverter. Since an inverter 
is ill-equipped to remain in a state 


between low and high, R1 is * a ]s3av 
“oe . M1 M3 M5 

employed to force it into the linear 1 ( i i 1 i 

region, at least initially. Cl and C2 Wea ezi) Wel 


are a mystery to most designers; 
they are there because the crystal 
manufacturer specifies them. 7: U | 
The whole arrangement is a tu) ty Wet) 
bit curious. An oscillator needs to =o 
have positive feedback, yet the 
phase-shift between the input and 
the output of an inverter is 180 s Cnstal Lg 
degrees - negative feedback. To 
understand this we need to look at or 0 
the crystal itself. a = 
A crystal is simply a sliver 
of quartz that vibrates. Quartz is Fig. 11-23: CMOS crystal 
piezoelectric, i.e. a voltage applied oscillator. 
between two surfaces makes it flex and flexing it creates a voltage between 
its surfaces. 


The vibrating mass of the crystal ot Ri 
can be represented as a series-resonant LC i 30 
circuit (C1, L1) with a series resistance | : —_— 
R1. The Q (originally the quality factor) x 


of such an LC circuit is given by: 
Fig. 11-24: Model for a crystal. 


= 2*m* f *L, 
R, 


and the resonant frequency by: 
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1 


ve 2*1* ./L,* C, 


The values in figure 11-24 were chosen to give a series-resonant frequency 
of exactly 1OMHz and a Q of 20,000. The Qs of crystals range from 10,000 
to 2 million, i.e. far higher than those of LC circuits. Ceramic resonators, 
which are otherwise almost identical to crystals, have considerably lower 
Qs. 

C2 is the stray capacitance created by the contacts to the crystal and 
the wires and pins of the package. 

If we open the feedback loop in the circuit of figure 11-23 (as shown 
in figure 6-14) we can see what is happening. There are in fact two 
resonances, about 0.2% apart. The lower one is the series resonance, the 
upper one parallel resonance. At these two frequencies the phase shifts 
abruptly between 180 and zero degrees. 


The parallel 
. a resonance is created by Cl, 
C2 and the combination of 
4 . 
160 external capacitances. In an 
‘a 3 oscillator properly designed 
- i Gain for series resonance it is of 
8 . no concern. 
> 100 s 1 
F 801 & +3.3V 
ao 
ms Phase aft 
40 f 
4 W=2 
20 L+0.35u 
10 10.005 10.01 10.015 10.02 10.025 10.03 10.035 10.04 7 M2 1 
Frequency/MHertz 5kHertz/div 
R2 
Figure 11-25: Series and parallel resonance of a a. ae 
crystal. eae 
Ri 
But even the series resonance is 1Meg 


influenced by the additional capacitances. As 
you notice from the plot it is not exactly 1OMHz. 
There are two reasons: |. At resonance the o. Pee 
impedance of a series resonant LC circuit 10p | ie 
becomes very low, limited only by R1. Thus it a 
works best if the input impedance of the inverter _Fig. 1-26: Improved 


is low. In figure 11-23 all we have for input CMOS crystal 
oscillator. 


we! Crystal Lg 
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impedance is a 1OpF capacitance and the gate capacitances. Thus Cl sees 


itself in series with this capacitance 
and the effective capacitance is 
slightly smaller. 2. At resonance the 
phase moves from 180 degrees to 
zero, but it doesn't actually reach 
zero (the condition required for 
oscillation) until about 10.015MHz. 
If we shift the phase in the 
feedback loop with an additional 
resistor (working against C2) a zero 
degree phase-shift is reached at the 
series-resonant frequency. With R2 
the frequency is more accurate and 
the chances of the crystal operating at 
some unwanted frequency (including 
harmonics) are diminished. But be 
aware that R2 decreases the loop gain; 
make sure it safely exceeds unity. 


M1-drain/ V 


3.5 4 


0 0.5 


Time/mSecs 500uSecs/div 


Figure 11-28: Start-up of a crystal 
oscillator. 


Insec), i.e. you have to wait for 
30,000 cycles before you can see 
the actual wave-form. 


Figure 11-29 shows a 
different approach, for a bipolar 
process. Gain and a 180 degree 
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Figure 11-27: The insertion of R2 in 
figure 11-26 brings the phase shift to 
zero degrees at the series resonance. 


Simulating a crystal 
oscillator can be a frustrating task. 
The higher the Q, the longer it takes 
for the oscillation to build up. In the 
case of figure 11-26 it takes 3msec 
for the oscillation to reach full 
amplitude. If you want to measure 
the frequency accurately, you need 
to do this in very fine steps (say 


+5V 


Crystal 


Sine 


Q2 Q5 


To sus 


Fig. 11-29: Alternate crystal oscillator with 


bipolar devices. 
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phase-shift is obtained through Q4, which needs to run at a fairly substantial 
current (about 1mA for the differential pair). The base of Q4 is biased at 
2VBE, which gives sufficient voltage swing without saturating the 
transistor. 

Q3, on the other hand, saturates. By making its collector resistor 
larger than that of Q4, a pulse (square-wave) output is obtained, swinging 
between 0.5 and 5 Volts. 
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12 The Phase-Locked Loop 


The idea of the phase-locked loop surfaced as early as 1932, but 
remained an esoteric and expensive concept until the arrival of the 
integrated circuit. The PLL has the unique ability to capture a signal 
without requiring precision components, a welcome feature in a world of 
large variations. 

The phase-locked loop is primarily an analog concept and only 
when we treat it as such can we fathom its powerful capabilities. And here 
is where simulation fails us. Anyone who has ever sat down at a bench and 
observed a phase-locked loop grabbing a minute signal seemingly buried in 
noise will agree that a simulation cannot give you the same sensation. Ina 
real circuit, a phase-locked loop almost seems to be alive, capturing and 
hanging on to a signal as the frequency is changed; a simulation of the same 
circuit is a cold experience, giving you no intuition (and taking up an 
enormous amount of time). 

To get a feel for the operation of a phase-locked loop you need to 
understand the key component, the phase detector. So, let's begin with this. 
a The circuit in figure 12-1 is known 
a le as the four-quadrant multiplier, one of 
| 2 several schemes used as phase-detectors. 
Output There is a straight-forward 
differential pair, M5 and M6, which 
: amplifies a signal arriving from outside the 
a ly IC. But instead of the drains being 


Www | wil»—Wl veo connected to load resistors or an active 
load, their currents pass through four other 
| [icf transistors, which are turned on and off by 
Ln a local voltage-controlled oscillator 
(VCO). 
fey The signal inputs are at a certain 
DC bias level, say 1V or 1.5V, high 
enough to exceed the threshold voltages. 
Now imagine a square-wave at the 


Fig. 12-1: The phase detector. 
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VCO input with a frequency 
exactly the same as that of the 
input signal. This could be a rail- 
to-rail 2-phase square-wave or, 
preferably, a wave-form moving + 
200mV differentially, centered at 
about 2V DC. At first (figure 12- 
2) the VCO wave-form is in-phase 
with the input signal, i.e. the two 
cross zero at the same time. 
During the first phase of the 
VCO signal, the drain of M5 is 
connected to R1 (through M1) and 
the drain of M6 to R2 (through 
M4). During the second phase this 
connection reverses: The drain 


Diff Probe / mV 
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2 Miele) 


7 0 0.2 0.4 0.6 0.8 1 1.2 1.4 


Time/uSecs 200nSecs/div 


Fig. 12-2: With the VCO signal in-phase, 
the output is a rectified sine-wave with a 
positive average. 


current of M5 flows through M2 and R2 and that of M6 through M3 and 
R1. Thus, ignoring C1 for now, the output across the two load resistors is a 


Diff Probe / mV 


Time/uSecs 


Fig. 12-3: With the phase of the VCO 


signal shifted by 90 degrees, the average 


of the output voltage is zero. 


200nSecs/div 


rectified sine-wave with a positive 
average value. 

Now, let's keep the 
frequencies the same but shift the 
phase of the VCO signal by 90 
degrees (figure 12-3). The signal 
is now chopped at the moment it 
reaches its peak amplitude and the 
output shows equal positive and 
negative excursions. Thus the 
average differential output voltage 
is zero. 

If we shift the VCO wave- 
form a further 90 degrees (still 
keeping its frequency constant), 
the VCO wave-form chops the 


signal at the zero-crossings again, but now the output is inverted. 
Averaging it with Cl results in a negative voltage. 

Thus, with C1 back in the circuit, we have a DC (or low-frequency) 
signal at the output of the phase detector which is a measure of the relative 
phases of the two frequencies. If we use this "error" signal to adjust the 
frequency of the voltage-controlled oscillator, we have the phase-locked 


loop (figure 12-5). 
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Here is what will happen 
in slow-motion: Let's say the 
VCO is running at 1MHz and the 
input signal is some distance 
away in frequency, e.g. 800kHz. 
Since the two frequencies are not 
synchronized, there is no phase 
relationship yet. At this point the 
phase detector is merely a mixer, 
producing several new 
frequencies, such as the 
difference between the two 
frequencies and various 
combinations of harmonics. The 


Signal | Phase Low-Pass 
Detector Fitter 
+ 
vco 


Fig. 12-5: Block diagram of a simple phase- 


locked loop. 


AC, but the VCO starts to jitter 
around its free-running 
frequency. Move the input signal 
just a little higher in frequency 
and suddenly the jitter disappears 
and the VCO jumps into step 
with the input signal. Now we 
see a DC level at the output of 
the filter. 

As you continue to move 
the input signal higher in 
frequency, the VCO will 
continue to track it, until the loop 
finally runs out of control voltage 
This is illustrated in figure 12-6 


FM 


> 
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Fig. 12-4: At 180 degree phase shift 
the average of the output is negative. 


one of interest is the difference, 
200kHz; it is still too high to pass 
through the filter. 

As we move the input 
signal gradually higher in 
frequency, there comes a point 
where the difference frequency is 
low enough so that some of the 
signal passes through the filter and 
starts influencing the VCO. The 
signal is not rectified yet, it is still 


B of 


Time/mSecs 500pSecs/div 


Fig. 12-6: Locking behavior of a phase- 
locked loop. 


where signal frequency is swept from low to high over a Smsec period. 
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The exact same behavior is seen as you approach the VCO 
frequency from the high end. The capture range (the maximum difference 
between the two frequencies to achieve lock-on) is always narrower than 
the lock range (how far you can drag the VCO and still keep lock). It makes 
no difference which frequency is moved and which is fixed. 

Both the capture range and the lock range are influenced by loop 
gain and signal level. If you increase the gain of the loop or the input signal 
level, both ranges become wider. 

(Strictly speaking the name phase-locked loop is a misnomer. As 
you move through the lock range, the two frequencies are locked but their 
phase relationship has to change to produce the error signal, i.e. the phase is 
not locked. Frequency-locked loop would be a better name). 


The phase-locked loop has three important features, especially for 
integrated circuits: 


1. Apart from the loop gain, the capture range is determined by a 
single low pass filter. For example, if the signal you are looking for is at 
50Mhz and has a narrow band-width (say 5kHz), you dimension the low- 
pass filter so it rolls off at about 5kHz. This makes the phase-locked loop 
look like a very sharp band-pass filter. A single-pole low-pass filter rolls 
off at 20dB per decade, so at 49.9MHz and 50.1MHz the interfering signal 
is attenuated at the low-pass filter by 26db. Using an active filter, it would 
take many poles and a large number of precision components to achieve the 
same selectivity. 

The phase-locked loop depicted in figure 12-5 is a second order PLL 
(i.e. it has two poles, one by the VCO itself, the other by the low-pass 
filter). This configuration is unconditionally stable. Adding another pole 
makes stability (i.e. the absence of unwanted oscillation) more difficult to 
achieve, but it doubles the sharpness of the filter action. 


2. The VCO need not be highly accurate. As long as the free- 
running frequency is within the capture range of the signal, the loop will 
find the exact frequency. 

This advantage, however, is made a bit difficult if your capture 
range is very narrow. In the example above the free-running frequency 
would have to be within 5kHz of the signal, i.e. 0.1%. Without using 
accurate components, such precision can only be achieved by tuning, for 
example by sweeping the VCO over a wider range, detect capture (see 
below) and then stop the sweep. 
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3. The error signal (i.e. the output of the low-pass filter) is a measure of 
frequency deviation. If the input signal is frequency modulated, this output 
is the demodulated signal. 


There is even a simple solution if the modulation is AM, not FM 
(figure 12-7). In this approach the VCO has a second output (same 
frequency but shifted 90 degrees) and a second phase detector and low-pass 
filter are added. 


Signal | phase po In the middle of the lock range, 
| *| Detector |“) Filter * the phase shift between the signal and 
the VCO is automatically 90 degrees, 
er Ss so that the control voltage for the 
~ a VCO is zero. This means that the 


signal frequency is chopped at the 
amplitude peaks. A second phase 

| AM detector operating at zero degrees 
Detector | Filter a phase shift will, therefore, chop the 

same signal at the zero crossing and 
the result is a voltage proportional to 
Fig. 12-7: Phase-locked loop with amplitude. This output then delivers 
AM output. not only the demodulated AM but also 

indicates that the loop is locked. 


How do you design a voltage-controlled oscillator? We have seen 
some examples in chapter 11, but for most applications VCOs for phase- 
locked loops are specialized for high-frequency operation. We have two 
examples here. 
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Output 
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Fig. 12-8: Current-controlled oscillator for operation between 6MHz and 300MHz. 
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The first example is a current-controlled oscillator, though it is a 
simple matter to convert the current into a voltage. 

II sets the operating currents for five simple differential amplifiers 
with active loads. The outputs of each amplifier are connected to the inputs 
of the next stage and the outputs of the last stage back to the inputs of the 
first. It is a ring oscillator, relying on the delay caused in each stage. This 
delay is dependent on the operating current, increasing as the operating 
current is decreased. With I1=10uA the delay in each stage amounts to 
1.6nsec and the frequency is 300Mhz. With I1=0.1luA the delay increases 
to 83nsec and the frequency decreases to 6MHz. A remarkably large range. 

But the delay in each stage is caused by a number of effects, each 
with its own temperature coefficient. The net result is a variation in 
temperature coefficient from +800ppm/°C at 0.1luA to +200ppm/°C at 
10uA. This temperature coefficient can be partially compensated by 
introducing an opposite tempco into the voltage to current converter, most 
likely optimized at one operating current only. 


Our second example (figure 12-9) uses a different approach. Similar 
to the 566 oscillator in figure 11-1, a capacitor (C1) is charged and 
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Fig. 12-9: Voltage-controlled oscillator with a Schmitt Trigger. 
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discharged by a current. But using two comparators and a flip-flop would 
be too slow for high-frequency operation. So we employ a Schmitt 
Trigger, which has fewer devices in sequence and thus reduced delay. 

There are two thresholds again. The lower one is 1/2*Vdd and is set 
by the two equal resistors R5 and R6. When this lower threshold is 
reached, M21 and M25 turn on. This connects R4 in parallel to R5, which 
makes the upper threshold 2/3*Vdd. Notice that the resulting triangle 
waveform at the capacitor has an amplitude of only 1/6*Vdd peak-to-peak; 
we are trading accuracy for speed. 

This is a somewhat improved version of a Schmitt Trigger; the most 
important factor determining accuracy is the "ON" resistance of M25. If it 
amounts to a substantial part of the value of R4, the effective resistance will 
be higher and will have a different temperature coefficient than R5 and R6. 
To make this "on" resistance small, we should increase the gate width of 
M25, but we can only do this at the expense of speed. The dimensions 
chosen for M25 are a compromise. 

There is a separate stage (M26) to create a rail-to-rail swing and an 
inverter (M28, M29) to make both phases available to the phase detector. 

The rail-to-rail output of the Schmitt Trigger is also used to switch 
the capacitor current between charge and discharge (M18, M19). Again, the 
dimensions chosen here are a compromise. For accuracy over a wide control 
current range we want them to be large; to get a fast response, they need to 
be small. 

There is a voltage to current converter (M1-M7) and R3 is intended 
to be an external resistor. The control voltage is derived from Vdd through 
a resistor divider (R1, R2, R7) with a rest value of 1 Volt, so that the current 
tracks the two thresholds and the frequency is independent of supply 
voltage. With no signal at the input, the voltage to current converter 
produces 100uA. A large-value resistor can be inserted between the two 
input terminals and the base of M1 modulated with the error signal (thus 
changing the current by perhaps +10uA or +20uA. To adapt the phase 
detector shown in figure 12-1 to this VCO, an active load can be used, 
converting the differential error signal to a single-ended one and then 
bringing it to near ground potential with a current mirror. 

With Cl=2pF the frequency of oscillation is 36MHz, with a 
temperature coefficient of -370ppm/°C. At 60MHz (C1=1pF) the 
temperature coefficient increases to -680ppm/°C because of the greater role 
played by delay. Below 20MHz the temperature coefficient is close to zero. 
There is a + 0.3% change in frequency for a + 10% change in supply 
voltage. 
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The Schmitt Trigger 


Otto H. Schmitt was not a German electronics engineer as one would expect. 
He was born in St. Louis, Missouri in 1913 and studied biophysics and zoology. All 
through his life he built the electronic equipment he needed for his research himself and 
became an expert in electronics as well. In 1934 (at age 21) he was engaged in the 
study of the nervous system and came up with a bistable circuit which mimicked the 
behavior of a nerve (using vacuum tubes, of course). He never patented it. 

The schematic below is a translation of his circuit into MOS devices. With the 
input low, the gate of M2 is biased high through R1, thus M2 is turned on. The ratio of 
R2 to R3 sets a bias point at the sources of the two transistors. When the input is moved 
above this point (plus the threshold voltage of M1), M1 turns on, M2 turns off and the 
bias voltage is set to a lower level by the ratio of R1 to R3. 

Otto Schmitt died in 1998, after a long and productive tenure at the University 
of Minnesota. Although he is best-known for his "Schmitt Trigger", it represents only a 
minute fraction of his contributions to science and engineering. 


Output 


Time/uSecs 2uSecs/div 
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13 Filters 


We can go back as far as 100 years and find elaborate electronic 
filters, using inductors, capacitors and resistors. And the inductor in these 
combinations has always been the problem child, the largest, heaviest, most 
expensive and least reliable component. With the advent of integrated 
circuits its status moved from undesirable to virtually impossible. 

There is an intriguing relationship between the inductor and the 
capacitor; they are direct opposites. As you charge an inductor, the voltage 
appears first, the current follows later; in a capacitor the current must flow 
before the voltage can build up. If we build a circuit which shifts the phase 
180 degrees, a capacitor has all the appearances of an inductor. It is on this 
phenomenon that IC filters are based. 


Active Filters: Low-Pass 


R1 Consider a simple RC network. It has a 


15.9k C1 cutoff frequency (the point at which the amplitude 
T drops by 3dB) of: 
n 


1 


Saas = mRC 
Fig. 13-1: Single- 
pole RC low-pass For the 
filter. 
values 
shown in 


b 


figure 13-1 this amounts to 
10kHz. Below about 1kHz there 
is no attenuation. At 10kHz the 
signal at the output is down by 
3dB and at 100kHz (10 times f,) 14 
the attenuation amounts to 20dB. I 
If you extend the straight portion a 


Attenuation / dB 


of the curve (figure 13-2) upward {00 200 400. 1k 2k 4k 10k 20k 40k 100k 
it points precisely at 10kHz. Frequency / Hertz 
Such a single-pole, passive RC Fig. 13-2: Frequency response of a 


single-pole filter. 
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low-pass filter is said to have an attenuation of 20dB per decade or 6dB per 
octave (doubling of the frequency). 

We don't need to be a as 
satisfied with just one RC an PH 188k Tce > 
network, we can connect i 


several of them in series (i.e. sl aie 


cascade them). But we need 

to put a buffer in between the 
stages, otherwise the network 
following will load down the 

previous one too much. 

But look at what we have 
done (figure 13-4). A second stage 
will roll off faster, but it also 
lowers the -3dB frequency. The 4 
more stages there are in series, the A 5 
lower the cutoff frequency. With 4 
16 stages it has moved down to a 4 
little over 2kHz. 100 200 400. 1k 2k 4k 10k 20k 40k 100k 

We can do much better than Frequency / Hertz 
this on fronts. First, th 
ee ae ae ae Fig. 13-4: Placing identical low-pass 

stages in series lowers the cutoff 
will by choosing different resistor frequency. 
and capacitor values for each stage. 
There are schemes for this, worked out mathematically a long time ago, in 
the era of passive filters. They carry such names as Butterworth, Bessel and 
Chebyshev. 

Second, we can take advantage of active components (such as op- 
amps) and create more compact filter stages. There are many such designs, 
with names such as Sallen & Key, Multiple Feedback, Fliege, Bach, KHN, 
or Tow-Thomas. 


Fig. 13-3: A poor way to sharpen filter response. 


dB 


Figure 13-5 shows a second-order 
active filter, using a design developed by 
An | a, + R.P. Sallen and E.L. Key. Only one op-amp 
is required for two poles. 
C3 The component values are chosen to 
75p give a Butterworth response; with different 
= values we could change the frequency 
Fig. 13-5: Second order Sallen response to a Bessel or Chebyshev function. 
& Key Butterworth filter. As you can see from figure 13-6, the 


drop-off is now twice as steep as that of a 
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single RC network (i.e. 40dB per 
decade or 12dB per octave) and 

the -3dB point has remained at a 
10kHz. 


Attenuation / db 


Let's now take a look at 
three filters. The nominal 
designs are identical; they all 


have two cascaded second-order 100 200 400. 1k 2k 4k 10k 20k 40k 100k 
Sallen & Key stages. But each Frequency / Hertz 
filter has different R and C 
1 Fig. 13-6: Frequency response of the 
vee filter in figure 13-5. 
C4 C6 
|| || 
VT | 
R7 | R6 a Rit | R10 120 
3.6k 8.58k T St 7.82k 21.5k T ei 
in=— Bessel 
Ines C5 
C3 Sallen & Key Chebyshev 
Butterworth 
— = 1 
or Ge A Butterworth 
| 1\5n | 3.3n o 
R19 R18 ce > R3 R2 eo > ce) -2 
5.1k 16.2k ei 3.32k 9.03k ei 
in 2 
ines C1 
C11 Sallen & Key 
alk. Bessel cll 3 
3 
cis Gi 
aoe | _ 10n _ | ae 47n a 1k 2k 4k 10k. 20k 40k 
9.76k 34.1k T i 8.83k 27.5k T ei Frequency / Hertz 
ins 
82p —— C16 j 
c14 Salon & Key Fig. 13-8: Frequency responses of 
ebyshev 


; : the three low-pass filters. 
Fig. 13-7: Three fourth-order low-pass filters. 
The different component values result in 
different frequency responses. 


Judging by the frequency 
response alone, the Chebyshev filter has the sharpest response, though it 
produces some ripples in the pass-band (i.e. below 10kHz). This ripple can 
be reduced, at the expense of steepness above 10kHz. In even-order 
Chebyshev filters the ripples are above the line (OdB in this case); in odd- 
order ones they are below the line. 

The Bessel filter gives a gentle roll-off with no overshoot in the 
pass-band, and the performance of the Butterworth filter is in between the 
other two. 
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But there is more to the 
performance of a filter than just 
the frequency response. Take 
the phase of the signal, for 
example. It never stays 
constant in any filter with the 
delays caused by the 
capacitors. But there is a 
difference between the three 
filter types. The Bessel filter 
has the smallest phase-shift, the 
Chebyshev the largest. 

The phase response 
influences two more measures 
of filter quality. The first one is 
called Group Delay, shown in 


deg 


Chapter 13: Filters 


Bessel 
LU 


Butterworth 


Chebyshev 


Frequency / Hertz 


Fig. 13-9: Phase response of the three 


filters. 


figure 13-10. Assume that you pass through the filter not just one 
frequency, but several. A delay in the filter causes the phase-relationships 
of the different frequencies to change and distortion results. 
The Bessel filter is by far the 


Chebyshev 


Butterworth 


\ 


T 
Bessel 
1k 2k 4k 


10k 


20k 


Frequency / Hertz 


Fig. 13-10: Group delay of the three filters. 


100usec pulse was applied to the 
input. We expect a rounding of 
the corners at the output but, 
considering that all three filters 
have the same cut-off frequency, 
the Bessel filter does the best job. 


40k 


How do we get the values for 
the resistors and capacitors? If you open up a text-book on filters, you will 


Edition February 2005 


best in this respect, having not only 
the shortest delay but also the most 
constant. The Chebyshev filter is by 
far the wildest. 
Also, we can judge a filter by 
its pulse response. In figure 13-11 a 


43 
t-Chebyshev 
+ 
Bessel Butterworth 
a ‘ 
0. 
> 
0: 
0. 
Input 
0 20 40 60 80 100 120 140 160 180 
Time/uSecs 20uSecs/div 


Fig. 13-11: Pulse response of the three 


13-4 


filters. 
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see elaborate tables giving you coefficients for Butterworth, Bessel and 
Chebyshev functions. This is no longer necessary. There are a multitude of 
programs available on the web (many of them at no cost), which calculate 
these values for you. Search for "active filter software". 


Bessel, Chebyshev and Butterworth 


Friedrich Wilhelm Bessel (1784 to 1846) was a professor of astronomy 
at the University of K6nigsberg in Germany. By measuring the position of 
some 50,000 stars he greatly advanced the state of celestial mechanics and came 
up with the Bessel function, which was found to be also useful in filters. 


Pafnuty Chebyshev (1821 to 1894) taught mathematics at the 
University of St. Petersburg. His major contribution was the theory of prime 
numbers but, similar to Bessel he left behind a function which later turned out 
to be applicable to filters. 


Of Stephen Butterworth we know only that he worked at the British 
Admiralty for almost all his life. In 1930 he published a paper "On the Theory 
of Filters". He died in 1958. 


Let's look at two more | ii 
low-pass filters, using designs A og peo kX a: 


other than Sallen & Key. The ia. “ 
two stages in figure 13-12 use 5 x - 4 
voltage-controlled voltage- vk pik 

sources (VCVS), an approach cee all 
differing from Sallen & Key 


Fig. 13-12: 4th-order low-pass Butterworth filter 


only in that the op-amps have in a voltage-controlled voltage-source design. 
gain. 
ae. ee ae o3 The design 
a os OE a approach for each stage 
‘sk | 75k i 62k | 3k > of figure 13-13 is 
ao lS called Multiple 
a - Feedback. 
= 7 bs = All these 
Fig. 13-13: A 4th-order Multiple Feedback approach. different approaches 


render the same 
frequency and phase response, but they differ in sensitivity, i.e. how much 
component and op-amp parameter variations will influence filter 
performance. A temperature and Monte Carlo analysis reveals the merits. 
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High-Pass Filters 


There is no mystery 
to converting a low-pass Gi ce co | | 
filter into a high-pass one: in 4a Po in 4 
you simply exchange Rt R3 
resistors and capacitors. 


The drop-off now 
occurs toward the low- 


Fig. 13-14: High-pass Sallen & Key filter with 
Butterworth values. 


frequency end, but at the same rate as 
Al that of a low-pass filter, 80dB per 
= decade for a fourth-order filter. 


dB /db 
Bb 


Note that in all of these 


6 drawings, abstract op-amps are used 
6 (inside the symbol is an ideal voltage- 
7 controlled voltage-source). In a 
ik RR TOR BOR OK ok ©=s Practical design you have to consider 
ee ee the power supply. With a single 


supply, you may have to bias the input 
midway between ground and +V. In 
figure 13-14 this is accomplished at 
the low ends of R1 and R3. 


Fig. 13-15: Frequency response of the 
Ath-order high-pass filter of figure 13-14. 


Band-Pass Filters 


Take the R2 

second-order low- ook 

pass filter of figure | i j 

13-5 and convert a fe 4 2° 

one RC network to ao! dra fe 

high-pass. You tn 718K ig oe " 

now have a drop-off 42 

in amplitude at both ae ee ee Le a ee 
high and low Fig. 13-16: Sallen & Key band- —_— Fig. 18-17: Second-order 
frequencies. pass filter. band-pass filter response. 
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Although the arrangement is called a second-order band-pass filter, the 
drop-off rate is only first-order, 20dB per decade, since only one pole is 
active in each frequency segment. We can of course improve this by adding 
more stages, each stage contributing another 20dB per decade drop-off. 
And here there is a bewildering number of schemes available, with names 
such as Wien-Robinson, Deliyannis, Fliege, Twin-T, Mikhael- 
Bhattacharyya, Berka-Herpy and Akerberg-Mossberg. Your filter program 
will tell you which one to choose. 

There is also an additional choice for the frequency response: 
compared to the Chebyshev filter the elliptic (or Cauer) has an even steeper 
initial drop-off, but the attenuation in the stop-band (i.e. outside the pass- 
band) is not flat. 


Fig. 13-18: A fourth-order, twin-T elliptic band-pass filter. 


The filter of figure 13-18 has two Twin-T stages. The first stage is 
a second-order low-pass notch 
configuration, the second stage is 
called a second-order high-pass 
notch filter. The center frequency 
was chosen to be 50kHz, the 
bandwidth 2kHz. Just outside the 


dB /db 


bandwidth the attenuation reaches 2 
a maximum, but then settles down 
to a modest 15dB. . 


42 44 46 48 50 52 54 56 58 60 


Frequency/kHertz 2kHertz/div 


It must be clear to you by 
now that active filters are costly. 
Not only do they require precision 
components, but the values of most 
capacitors and some of the resistors are such that they cannot be integrated. 
A fourth-order low-pass or high-pass filter requires at least eight external 


Fig. 13-19: Response of a fourth-order 
elliptic band-pass filter. 
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components and five pins. For a band-pass filter with only modest 
performance 14 external components and pins are needed. 


Switched Capacitor Filters 


If we charge a capacitor (Cr) by 
closing switch S1 for a brief period of time, rh) 
then open S1 and close S2 for the same t 
amount of time, the potential across the a + 
capacitor is first that of V1, then V2. 2 = | ss e 

One of the handiest formulae to carry _Ler 


in your mind is: 


es Fig. 13-20: Making a resistor 
out of a capacitor by switching 
i.e. the charge in a capacitor (in Coulombs) at a rapid rate. 

is given by either the capacitance times the 

voltage or the current flowing into the capacitor for a certain period of time. 
In the case of figure 13-20, the current flowing between the two terminals 


over one period is 


Let V1-V2) 


=C,*(V1-V2)*f 


clock 


clock 


If we had a resistor between V1 and V2 instead of the switches and 
the capacitor, the current flowing through it would be: 


_ (V1-V2) 
OR 


I 


Thus the equivalent resistance of the switched capacitor is: 


= 1 
CR* f ctock 


Let's look at some numbers. Suppose the switching frequency is 
100kHz and Cp = SpF: 


1 


Ke a 2*10° =2MegOhms 
10°*5*10-~ 
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Thus, with a relatively small capacitor we can create the equivalent 
of a large-value resistor. If we were to implement such a device directly, 
the cost in area would be prohibitive. 

But the area reduction is just the first benefit of switching; there is 
more: if we use this resistor in a filter, the absolute capacitance value 


disappears. 

Shown here is a simple, one- es 
pole low-pass filter. The cutoff @) 
frequency is given by: a ele 

_ + 
] 7 7 
_— S1 S2 

Soap 2em* R*C Alea Ele 

Substituting the equivalent oe =e 
resistance of the switched capacitor we - 7 
get: 

a 7 

f = F ctock % Ce + i 

3dB De TU C i 

If we make the two capacitors Fig.13-21: By using the equivalent 

equal (any value) and switch at a rate bead ie ae alse 
: . in a filter, only itor ratio 

of 100kHz we get a filter with a cutoff and the clock frequency are 

frequency of 15.9kHz. If C is ten important. 


times the size of Cr and the clock 

frequency remains at 100kHz, the cutoff frequency decreases to 1.59kHz. 
Thus, the switched-capacitor filter has two significant advantages 

over the active (linear) one: 


1. A low cutoff frequency can be achieved with capacitor values small 
enough to allow integration. 


2. The cutoff frequency is not influenced by absolute variations. Given an 
accurate clock frequency and capacitor ratios of 1%, the cutoff frequency 
will be within 1%. 


The simple low-pass filter can be expanded into any of the 


configuration discussed under active filters. Take for example the Sallen & 
Key filters in figure 13-7. In a switched-capacitor design you would first 
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greatly reduce the values of the capacitors and then replace the resistors 
with a capacitor and switches. 

The switched-capacitor filter requires lateral switches, which are 
easily implemented in CMOS, but cumbersome (and slow) in a bipolar 
process. For this reason, this approach has become exclusively CMOS 
territory. 

To minimize the influence of stray Pht 
capacitances four (CMOS) switches are 
often used instead of two, resulting in an 
inverting configuration. Ph2 fe 


CR 
| | 
VI 


Ph2 
Ph1 
For either switch design it is 
important that the two lateral switches - 
never be closed at the same time, i.e. there Fig. 13-22: Switch 
must be some "dead-time" between the configuration to minimize 
the effect of stray 
two phases of the clock. capacitances in CMOS. 
There are four disadvantages with 


switched-capacitor filters: 


1. No matter how carefully you design the switches, there is always some 
switching noise. 


2. A switched-capacitor filter samples the signal. To get an adequate 
sample, the highest signal frequency cannot exceed about 10% of the clock 
frequency. If there are signals present above that point, the switched- 
capacitor filter will produce a mixture of new frequencies, some of which 
may appear in the 0 to 10% frequency range. To avoid such false signals, a 
linear (active) filter must be used at the input (an anti-aliasing filter). 


3. With an ordinary simulator, switched-capacitor filter can only be 
analyzed in real time; you cannot take advantage of the many features of an 
AC analysis, such as measuring frequency and phase response. And with 
the clock frequency necessarily being high, simulation takes far more time 
compared to an active filter. Only if the simulator has additional features, 
such as time delay in the AC model, can it give close to the same picture as 
that offered by linear AC analysis. There are some programs that have been 
designed exclusively for the analysis of switched capacitor filter. 


4. The output has sampled noise, which is present even if the input is zero. 
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14 Power 


Linear Regulators 


Let's say you have 12 Volts available but need 3.3. Your 3.3-Volt 
load consumes up to 500mA. The 12-Volt source (e.g. a car battery) 
fluctuates between 10 and 14 Volts; the lower voltage needs to be within 
5%. 

The immediate choice to effect this change in voltage is a linear 
regulator. Look at it as a variable resistor, dropping whatever voltage is not 
needed. 


The unwanted 


7 + = 7 Vec 
. S hay voltage is dropped in 
»—_#—{a012 an NPN transistor. In 
a2 epi a7 1 a4 Q13 figure 14-1 this isa 
“]10p a4 de Darlington 

el ¢——_—4+—Karo R3 configuration to 

1 < 26.25k soe : 
; ; minimize the drive 


a RA current; it requires at 
Q16 15k 
as) il i ce least 2.2 Volts 
Re rt difference between 


Vcc and Vreg, but it 
is an easy and simple 
design. 

The regulator 
uses a 1.2-Volt 
bandgap reference 
(see chapter 7), whose voltage is compared with a fraction of the regulated 
output by the differential amplifier Q5, Q6, Q7 and Q1O0. Once the circuit is 
in balance the voltages at the bases of Q5 and Q6 are equal, so the regulated 
voltage is: 


Fig. 14-1: Linear regulator with NPN power stage. 


_ Vref *(R3+ R4) 


Vre 
2 R4 
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An operating current is set up by Q1 to Q4 (a circuit derived from 
figure 5-4) and mirrored by Q9. At this point we have about 150uA and the 
current has a deliberate negative temperature coefficient (R2, which creates 
this current, is connected across a VBE, which itself has a negative tempco). 
This counteracts the positive tempco of hFE. 

Q10 shunts to ground whatever operating current is not needed by 
the output stage. 

Using a Darlington 
configuration for the output 
greatly reduces the required 
operating current, but there must 
always be a substantial voltage 
drop between supply and output. 
For this reason such a circuit is 
anything but a low-dropout 
regulator. For our application, a 
conversion from 10 Volts min. to 


Output Voltage / V 


3.3 Volts this is of little concern. 7 2 4 § § 16 72 

The current that flows Supply VoltageV 2vidiv 
through the load also flows Fig. 14-2: Drop-out voltage of NPN 
through the output transistor. So, at regulator. 


500mA, the load consumes 1.65 
Watts, the regulator 4.36 Watts 
(with 12-Volts in), which is 
simply converted into heat. 
Output Transistot This the main disadvantage of 
a linear regulator. The heat is 
produced mainly by one 
device: Q13. Thus there will 
be a hot-spot on the chip and 
resulting temperature 
gradients, even with an 
adequate heat-sink. These 
Supply Voltage/V 2vidv temperature gradients are 
Fig. 14-3: Ina linear regulator the energy not bound to influence other 
required by the load is converted into heat. circuitry on the chip, including 
the regulator's own reference. 
A linear regulator with an NPN output transistor is relatively easy to 
compensate. Despite the fact that the loop gain is high (which results in an 
output impedance of a mere 4mOhm) the circuit is rendered stable with a 


2.5 


Power Dissipation / W 


0 2 4 6 8 10 12 
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single 1OpF compensation capacitor. This stability holds even if a filter 
capacitor (of any size) is added at the output. 


Output Voltage / V 
wo 


6 z 8 9 


Time/pSecs 1pSecs/div 


Fig. 14-4: The regulator is stable, even 
with a filter capacitor at the output. 

Because of the low output 
impedance it takes a massive 
capacitor at the output to have an 
effect of power supply rejection 
(figure 14-5). 

There are three transistors 
and a resistor in this design which 
we have not discussed yet. The 
differential pair Q14/Q15 
compares the reference voltage 
(which is assumed to have a very 
small temperature coefficient) with 
a voltage slightly higher than 2 
VBE (which has a strong negative 
tempco). At temperatures below 
about 120°C the voltage at the base 
of Q14 is higher than Vref and Q15 


Power Supply Rejection / db 


Output Voltage / V 


No Output Capacitance 


=] 
100uF 


1k 2k 4k 10k 20k 40k 100k 400k 1M 2M 4M 10M 
Frequency / Hertz 

Fig. 14-5: Power supply rejection 

with and without a filter capacitor. 


20 40 60 80 


100 120 


Temperature/Centigrade 20Centigrade/div 


Fig. 14-6: Temperature shut-down. 


is cut off. But at about 140°C these two voltages become equal and Q15 
diverts the operating current for the output stage. Thus when the chip gets 
too hot, the output collapses and the source of the heat disappears. This 
makes the regulator virtually indestructible. As the Monte Carlo analysis 
indicates (figure 14-6) the accuracy of the shut-down point is + 10°C. 
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Low Drop-Out Regulators 


Chapter 14: Power 


To get a lower 
minimum voltage drop 
we need to replace the 
NPN Darlington 
transistor at the output 
with a PNP (or P- 
Channel) device. And 
here is where the problem 
starts. 

Output transistors 
need to be large to carry 


300u 


the current and thus have . : 
substantial capacitance, 
multiplied by the Miller 
effect. This forms an 


Fig. 14-7: Low drop-out regulator with internal (lateral) 


PNP transistor at the output. 


additional pole, which gives the regulator a stubborn tendency to oscillate. 


y2 v1 
160 16 Phase 
140 14 
120 12 
3 
© 100 10 
D 2 
Sg < a 
3 g 
S 60 6 
a 
40 4 
20 a 
0 
“20 205 70. 100. 1k 10k 100k iM 


Frequency / Hertz 


Fig. 14-8: Phase/gain diagram using three 
compensation capacitors. 


offers the benefit of increased power 
supply rejection, but its effectiveness 
is limited by the series resistor 
(which is essential to form the zero 
and turn the phase back up). At 
frequencies above about 5kHz the 
power supply noise appearing at the 
output is simply determined by the 
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It takes three capacitors to 
quiet down this regulator. Cl 
provides the main pole at about 
30Hz. C2 corrects the phase at 
very high frequency and Cext, 
together with Rext form a zero at 
1kHz. In addition the loop gain is 
reduced with R1 and R2. Even so, 
the phase margin is barely 50 
degrees. 

The external capacitor 


Power Supply Rejection / dB 
PI 


9 


1m 10m 100m 1 10 100 1k 10k 100k 1M 10M 


Frequency / Hertz 


Fig. 14-9: Power supply rejection. 
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collector capacitance of Q10 and Rext. 

Q10 is a large lateral 
PNP transistor with an effective 
emitter length 40 times that of a 
small device, which makes it 
capable of carrying about 20mA. 
At this current the drop-out 
voltage is 300mV and the output 
impedance is 4mOhms. 

Current capability can be 
increased at will, simply by 
making the output transistor 

iid soomvidv larger. Low drop-out regulators 
Fig. 14-10: Drop-out voltage at 20mA. using lateral PNP transistors 
have been built for up to 5 
Amperes, but at such high 
current levels it helps having a special process which provides a higher 
doping level for the emitter. Even so, the output devices take up some 80% 
of the chip area. Be aware that, as the lateral PNP transistor saturates at the 
drop-out voltage, its substrate current becomes very large. 


Voltage 


Vreg /V 
ie) 


2 2.5 3 3.5 4 4.5 


High Currents in an IC 


There are two factors which limit how much current an IC can carry. The 
first is electro-migration. The force of huge numbers of electrons rushing through a 
conductor can become so large that the electrons begin to move atoms, physically 
push them along. For pure aluminum this happens at about 500'000 Amperes/cm”. 
The effect is slow, it may take months, but eventually there will be an area where 
there is no aluminum left. Electro-migration is aggravated by high temperature and 
depends on the composition and grain structure of the metal. 

Half a million Amperes may seem large and safe, but when you consider that 
you are dealing with very thin layers, the limitation becomes real. For example, for a 
thickness of 10'000 Angstroms (10'000A = lum) and a width of lum a current density 
of 500'000A/cm’ is reached with just 5mA. 

The second limitation is resistance. Pure aluminum has a resistivity of 
2.8uQcm. Thus a layer lum thick has a sheet resistance of 28mOhms/square. Make 
this run 100um long and you have a resistance of 2.83Ohms. 

Let's say you want to carry | Ampere over a distance of 1000um on a chip. 
With a thickness of lum, the aluminum stripe would have to be at least 200um wide 
to avoid electro-migration. It then would have a resistance of 140mOhms, i.e. drop 
0.14 Volts. 

And don't forget to check how much current contacts and vias can take in 
your process, as well as the thickness required for bonding wires. 
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t , , rigs Figure 14-11 shows 
wz a CMOS version of figure 
14-7. Also dimensioned 
for 20mA, the P-channel 
output device is smaller 
than the previous lateral 
' 10k PNP transistor. A low 
| | 1 cea dropout voltage is, 

mu io however only present at 
2 " i low current; to get the 
vies | i | bt 20k fet same value at 20mA, M9 

=tu would need to be 20 times 

194 the indicated size, or a total 
_—_- —+ + +  widthof 8000um. 


Fig. 14-11: Low-drop out CMOS regulator. 11 
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The circuit was designed for a ts 
supply voltage of 3 to 3.6V anda 
regulated output of 1.8V. 

Frequency compensation is nearly u 
as difficult as in the previous example. : 
Again an external capacitor with a resistor 
in series is necessary at the output to SE ae ee ee ee 
create a zero and turn the phase up. ~ “_ 


M9-drain / V 
® 


20mA 


Fig. 14-12: Drop-out voltage. 


Y2 Y1 
180 181 4 
160 161 4 
140 14 a 
Phase 2. 
120 121 c 
g 3 6 
8 100 - 10 & 
3 il oc -6I 
= 80 a 8 2 
z 60 6 é 2 
0 a Gain g 7 
& 
20 2 “7 
0 “8 
720 i0 100. 1k 10k 100k 1M 10M 100M 1k 10k 100k 1M 10M 100M 
Frequency / Hertz Frequency / Hertz 
Fig. 14-13: An adequate phase margin is Fig. 14-14: ...but Rext limits power 
achieved with Rext .... supply rejection at high frequency. 
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The last linear 
regulator makes the most 
sense for higher-current 
applications. Using an 
external PNP power 
transistor, it requires an extra 
pin, but it greatly reduces the 
area and power dissipation of 
the IC. Also, depending on 
the external transistor used, 
the drop-out voltage can 
remain low even at high 
current. 

With the chosen 
device for Q6, the maximum 
current is around 500mA. At 


100mA 


Vreg/V 


3 3.2 3.4 3.6 3.8 


Fig. 14-16: Drop-out voltage. 
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Fig. 14-15: Low drop-out regulator with external 
PNP transistor. 


this level the supply voltage can drop to 
within 200mV of the output (3.3V). 

There is, however, a compromise 
in the loop gain, which effects the output 
impedance (33mOhm); in order to achieve 
stability, the gain has to be reduced (R1, 
R2). Also, the same scheme as used in the 
two previous circuits is required: an 
output capacitor with a resistor in series to 
keep the phase from reaching zero before 
the gain does. 


Phase 


Phase / degrees 
es 
s 
Gain / db 


Power Supply Rejection / dB 


Gain 


a 10 100 1k 


Frequency / Hertz 


Fig. 14-17: Phase margin can only be 
kept high by a resistor in series with 


the output capacitor .... 
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10k 100k 1M 10M eal 10 100 1k 10k 100k 1M 10M 


Frequency / Hertz 


Fig. 14-18: ... which again impairs 
power supply rejection at high 
frequency. 
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Switching Regulators 


Assume again that you have a supply voltage of 12 Volts, but you 
need 3.3 Volts. Your load consumes 1 Ampere. 

A linear regulator acts as a resistor which drops the unneeded 8.7 
Volts. In the process it converts 8.7 Watts into heat. 3.3 Watts are used by 
the load; a rather dismal efficiency. 

Enter the switching regulator: instead of creating a resistance 
between input and output, it connects an inductor between the two for short 
periods of time. 

Vee The switch, S1, is driven 
PWM aie by a pulse generator (PWM, or 

Co pulse-width modulator). The 

pulses are rapid, so that the 

= = ee Rlcad inductor value can be small. 

Zs [4m 9, The inductor, together 

with Cl, smoothes out the 

switching pulses. 

When the switch is 
closed, the left node of the 
inductor is at Vcc (assuming the 
switch has no resistance), but when the switch opens, this voltage jumps 
abruptly to a large negative value, created by the energy stored in the 
inductor. It is the purpose of D1 to catch this negative spike so it does no 
harm to the switch and provides a path for the current during the off period. 

Figure 14-20 shows the 
resulting waveform at the output, for 
duty cycles of 10%, 20% and 40%. 
The average output voltage is simply 
proportional to the duty-cycle, but 
there is a noticeable ripple, the 
remains of the switching frequency 
(100kHz). 

There is also an overshoot, 
which becomes more pronounced as 
the duty-cycle increases. This 
undesirable behavior is due to the LC 


Fig. 14-19: Reducing a supply voltage with a 
series switch and inductor. 


Output / V 


filter (L1, C1). Tinhsocn ini 
With ideal components the Fig. 14-20: The resulting waveform at 
voltage conversion is 100% efficient. the output. 
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But when you add some resistance to the switch and inductor and a forward 
voltage drop for the diode, the efficiency drops. For example, with a total 
resistance of just 5|0mQ 
and a diode drop of 
0.3 Volts (a Schottky 
u, diode) the efficiency is 
= ww = = 7. Vreg 0A%. 
32k The circuit of 
* atoaa gure 14-19 is nota 
ance r a + oe regulator; we have to add 
eos feedback to make the 
Triangle ° Cy output voltage immune to 
supply fluctuations. This 
is accomplished by 
amplifying the difference 
between a fraction of the 
output voltage (RI, R2) 
and a reference voltage in an error amplifier. S1, an abstract simulation 
symbol, is now used as both a switch and a comparator (with the on/off 
thresholds set just a few millivolts apart). The output of the low-pass filter 
(R3, C2) following the error amplifier is thus compared with a triangle 
wave (100kHz, 2Vpp). In this way the regulator finds the duty cycle which 
gives the desired output voltage. Such a circuit is generally called a Buck 
Regulator. 
There are a few items to 
consider, which are peculiar to a 
switching regulator: 2 
First, an actual switch is not 
a perfect device; you will have to 
make a painful compromise, 
weighing voltage drop and speed: : 
the lower the voltage drop the 
more current it takes to drive the 
device. For example, a discrete | i i a: tL 
MOS transistor with an "on" eee sees 
resistance of 100mQ at 1 Ampere 
has a total input capacitance of about 
InF. At a switching frequency of 
100kHz you will need to turn the device on and off in less than 5Onsec, 
otherwise the dissipation during switching becomes significant. This means 
the output of the comparator (the driver stage) has to provide 100mA to 


Error Amp. 


Fig. 14-21: "Buck" regulator. 


Probe1-NODE/V 
a 


Fig. 14-22: Output voltage vs time. 
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charge and discharge InF. If you push the switching frequency to 500kHz, 
this current increases to 0.5 Amperes. 

Second, the current level that the switching transistor needs to 
handle is always larger than the average output current. If you use a small 
inductor, the peak current can exceed the average by a factor of three or 
more; with a large inductance this factor is between 1.1 and 1.4. 

Third, the voltage drop (and switching speed) of the diode is just as 
important as that of the switching transistor, their peak currents are roughly 
equal. 

Fourth, the output LC filter (L1, C1) form a pole, which makes 
frequency compensation (R3, C2) more challenging. 


We can step up the voltage 


by using the induced voltage in an ut iad 
inductor. Switch S1 connects the a 
inductor L1 across the power supply | y oe 
(here assumed to be | Volt). The 
current flowing through the inductor ales re 
is given by: aa io 
PWM S20 
sii © 
L = =" = = — 
. . Fig. 14-23: By using inductive charge the 
As soon as the switch is output voltage can be made higher than 
turned off, a positive voltage appears the supply voltage. 


at the anode of the diode, created by 
the stored current. This voltage is averaged by Cl. 


70 Percent 


Percent 


mA 


D1-cathode / V 


1.435 1.44 1.445 1.45 1.455 1.46 
Time/mSecs 1mSecs/div 


Time/mSecs 5uSecs/div 


Fig. 14-24: Output voltage for three Fig. 14-25: Currents through switch and 
different duty cycles. load. 
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The magnitude of the output voltage depends on how long the 
inductor is charged (i.e. what peak current is reached). Thus, by changing 
the duty cycle, the output voltage is altered. Note that in this configuration, 
too, the current the switching device must handle is considerably larger than 
the output current. 

Vee (1 Volt) Add feedback and we 
have a Boost Regulator. As 
| —_ before, the switch symbol 

at represents both the switch 
a ct ¢Rtoad and a comparator (i.e. the 
$1 4 — ¢ [aru switch turns on and off 
we ke I, 3er within a few millivolts of the 
© “Tioon a differential input signal). 
: ie = = ee But be aware that, in this 
configuration, the feedback 
circuitry must have some 
specific characteristics: the 
output of the error amplifier must be constrained so that it stays within the 
amplitude of the triangle wave-form, otherwise the regulator can hang up at 
either zero or full output. 

The frequency compensation . 
network (R3, C2) also provides a 
"soft start", i.e. the output voltage 
builds up gradually, without much of 
an overshoot. 


u 


10u 
ie Output (2.5V) 


S 


Error Amplifier 


Fig. 14-26: Boost switching regulator. 


Output Voltage / V 


The same principle of using 
the inductive "kickback" voltage is 
also used to regulate larger voltages 
(such as a 110V or 220V line input). ee ee 
The inductor becomes a transformer, Fig. 14-27: Soft-start of the boost 
with a secondary winding delivering a regulator. 
lower voltage (isolated from the line). 

Feedback to the switching device is effected through an optical link (an 
LED and a phototransistor) to also provide isolation. 

Some of the devices in such a line regulator, including the switching 
transistor, need to operate at high voltage; you need to be aware that this 
increases device size considerably (see panel). 
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The Voltage Penalty 


As we have seen in chapter 1| (figure 1-15), depletion layers take up space. 
The higher the operating voltage, the wider the depletion layer. Thus the diffusions 
not only need to be deeper, but also more widely spaced. 

Just how serious is the penalty of using large voltages in an IC? Take a look 
at the drawing below. It compares the required areas for minimum-geometry bipolar 
transistors operating at 5, 20, 40 and 100 Volts: 


Max. Voltage Dimensions,um_ Area Ratio 


152x220 


If only a small portion of the circuitry is required to withstand a high 
voltage, you wouldn't want all of the devices to pay the price of large dimensions. 
This then calls for a more complex process, one capable of producing both shallow 
and narrow devices and deep and wide ones. 


Linear Power Amplifiers 


An ordinary amplification stage (e.g. figure 8-1) is categorized as 
Class A. There is a steady DC current through the transistor and, in the 
extreme, this current can be varied between zero and twice the idle value. 
The power efficiency of such a stage is dismal: it can only reach 50% at 
maximum output; with smaller signals it is much lower. In ordinary 
amplification we usually don't care about efficiency, but when it comes to a 
power output stage, class A is ill-suited. 

In a Class B amplifier two output devices are used, one for the 
positive-going signal and one for the negative half. There is no idle current, 
each device starts to conduct as soon as the signal crosses the zero 
threshold. 
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This is an idealized concept which does not really work in practice. 
It is very difficult to switch from one device to the other without either 
leaving a gap or having both devices conduct at the same time. The result is 
distortion, which becomes very noticeable at low signal levels. 

The solution is a compromise: allow a small idle current so that the 
amplifier works in a class A mode with small signals and gradually moves 
to class B as the signal increases. This operation is called Class AB. 
ay Such an amplifier is 

shown in figure 14-29. 

The two output devices are 

Q10 and Q14. They are 
ato large, having an effective 
200 emitter length some 200 

times that of a minimum 
speaker  eOMetry transistor. 
| speaker Ideally we would 
: want one of the two output 

devices to be a PNP 
transistor, to exploit the 
200 complementary nature of 
the "push-pull" output. But 
NPN transistors carry a 
. — ae much higher current than 
Fig. 14-29: 5-Watt bipolar class AB amplifier. PNP ones (unless a 
complementary process is 
available); with a 5.8 Watt output capability (requiring peak currents of 
1.2A) this is no minor consideration. 

To deliver the high output current, the upper stage (Q8, Q10) uses a 
Darlington configuration. Q9 serves to by-pass leakage current at high 
temperature. 

The lower output stage has the identical Darlington connection plus 
a PNP transistor. The entire four-transistor block behaves like a PNP 
transistor. (All PNP transistors in this circuit are fairly large, capable of 
carrying 3mA). 

There are three base-emitter junctions between the base of Q8 and 
the base of Q11. Between these two nodes a voltage is provided which 
causes a few hundred microamperes of idle current to flow through the two 
output transistors. This is done with the current I2 and transistors Q6 and 
Q7. The VBE of Q6 is increased with the resistor divider R5/R6 to the 
point where the desired current is reached. Notice that Q6 tracks the VBEs 
of Q8 and Q10 and Q7 tracks that of Q11. 


Input 
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The feedback resistors R1/R2 
set the gain at 30dB and C1 provides 


frequency compensation. The slowest 


device in the amplifier is the 


compound PNP transistor Q11 to Q14, 


but it is fast enough to allow a more 


than sufficient frequency response for 


an audio amplifier without creating 
stability problems. 

One significant drawback of 
using only NPN power devices is 
voltage drop. Only +10 Volts are 
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Gain/dB 


1k 10k 100k 1M 10M 


Frequency / Hertz 


Fig. 14-30: Frequency response of 
the class AB amplifier. 


available at the output from the +12 Volt 


1 


100 


Spectrum(Q12-C) / V 


Frequency/kHertz 500Hertz/div 


Fig. 14-31: Spectrum of the output signal 
at full power. 


It is often argued that, in 
audio applications, peak power is 
rarely required and so the heat sink 
for the amplifier can be reduced in 
size. Unfortunately, in a class B (or 
AB) amplifier, peak dissipation 
occurs not at peak output, but at 
about 50% of maximum power. 


The design of figure 14-29 
requires a split power supply. There 


power supply without creating 
distortion. At 10Vp, however, the 
distortion amounts to only 0.15%. 

The maximum efficiency of 
an ideal Class B amplifier is 76%. 
For this circuit, with its 2-Volt drop 
in each output device, the maximum 
efficiency amounts to 62%. Thus 
the output transistors produce 1.7 
Watts of heat each (for a 5.6 Watt 
output). 


Figure 14-32: Power dissipation vs. 
power output in a class B amplifier. 


are two ways to avoid this. We could convert the -12V connection to 
ground, make Vcc 24 Volts, bias the input at 1/2 Vcc and couple the 
speaker through a capacitor. The only problem with this approach is the 
size of the new capacitor: 2000uF to get a 3dB drop-off at 10Hz. 
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Fig. 14-33: Class AB amplifier with 


bridge output. 


A better solution is the Bridge 
Output. In essence there are two 
amplifiers, 180 degrees out of phase. With 
no input signal, both output rest at 1/2 Vcc. 
As the signal appears, one output moves up, 
the other one down. 

In this configuration we have in fact 
doubled the output swing. With the same 
total supply voltage, 25 Watts of output are 
generated (which requires four output 
transistors with a capability of 2.5A each). 
Efficiency is unchanged at 62%, which 
produces a power dissipation of 15.3 Watts. 


Switching Power Amplifiers 


+10V 


Oo 


ty S1 
LA 
- ™ Speaker 
30u 
SquareWave RlLoad 
8 
+ $2 


-10V 


Fig. 14-34: Bidirectional 
switching arrangement. 


To start with, let's use 


two power supplies. The two 
switches connect the inductor 
to either the positive or 
negative rail. For now we 
assume that there is no dead 
time or overlap and that this 
switching action is instantaneous. 
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The goal is almost the same as that of 
the series switching regulator: lower a 
voltage across a load without creating much 
heat. There are two differences though: the 
output starts at zero and it can move in either 
the positive or negative direction. 


Time/uSecs 5yuSecs/div 


Fig. 14-35: Switching and output wave- 
forms. 
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The value of the inductor is fairly large for the chosen switching 
frequency (200kHz); it is never fully charged or fully discharged. Despite 
this, there is still a substantial ripple at the output. 


Inductor Current / mA 


0 5 10 15 20 25 


Time/uSecs 


Fig. 14-36: Current through S1. 


Fig. 14-37: Class D amplifier. 


now) the switches are ideal, they have no 


delay and insignificant resistance. 
Also, a filter capacitor (C1) has been 


30 


5uSecs/div 


added; this reduces the 200kHz ripple 


but increases the build-up delay 
mentioned above. 


The output is now a sine-wave 


with a small amount of 200kHz ripple. 
Since we use near-perfect components, 


the distortion is very small. 
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The average output voltage 
is a function of the duty cycle. At 
50% the output is zero; 75% 
produces +5 Volts and 100% +10 
Volts. Duty cycles of less than 
50% cause the output to be 
negative. 

Notice that the current 
gradually builds up (figure 14-36); 
the time constant of this effect is 
given by LI] and the 8-Ohm load (a 
speaker), a factor which will 
become important when we close 
the loop with feedback. 

Let's now take the next step 
and modulate the duty cycle with a 
sine-wave signal, making a Class 
D amplifier. As in the switching 
regulators, the switch symbols also 
act as comparators (1.e. the 
thresholds of the control terminals 
are set so that the switches turn 
from off to on (and from on to off) 
within a few millivolts. Also (for 


Output Voltage / V 


= 12 14 16 78 2 


Fig. 14-38: Output wave-form. 
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Output Spectrum / V 
Output Spectrum / V 


Frequency/kHertz 500Hertz/div 


Fig. 14-39: Frequency spectrum in the 
signal range. 


+10V 


Input Speaker 


1 270n 
y Ci 


RLoad 
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$2 


Triangle 


D2 


-10V 


Fig. 14-41: Pulse-width modulated circuit with 
more practical component models. The 
diodes are now required to absorb the voltage 
spikes. 


a P-channel one). In addition there is 
a small dead-time to avoid both 
devices being "on" at the same time. 
This dead-time creates a voltage 
spike from the inductor, which 
makes D1 and D2 necessary. 

These small imperfections 
have a significant impact on the 
fidelity of the output signal: 
distortion increases to 1%. 

Unless we use faster 
switching transistors with lower 
voltage drops and better matching, the 


Spectrum(Probe1-NODE) / V 
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150 200 250 300 350 400 450 


Frequency/kHertz 50kHertz/div 


Fig. 14-40: Frequency spectrum in the 


switching range. 


Alas, if we only had 
ideal components. In reality 
the switches have resistance 
and significant switching 
times. In addition, as pointed 
out on page 14-9, they require 
painfully large drive power. 

In figure 14-41 the 
models are changed to 
represent more practical 
components. The switch 
resistances, for example, result 
in larger and unequal voltage 
drops (200mV for an N- 
channel transistor, 300mV for 


Frequency/kHertz 500Hertz/div 


Fig. 14-42: Signal spectrum with 
realistic components. 
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level of distortion can only be brought down with feedback. And that is 
somewhat of a problem. 

In order to reduce the 
high-frequency components at 
the output we used an LC filter. 
It is dimensioned to be effective 
at 200kHz, but it causes a phase 
shift already in the audio range. 
Amplitude Because of this, the amount of 
feedback possible, using a single 
feedback loop, is limited to 
about 20dB. With two or even 

ik 2k ak 70K 20K 0K TOK three nested feedback loops this 
i figure increases to about 35dB. 
Fig. 14-43: Amplitude and phase response of Also, a loud-speaker rp 
the output filter. not really a simple resistor, there 
is some inductance as well, 
making the phase relationship in the feedback path even more complicated. 

We could, of course, increase the switching frequency, which would 
allow us to push the cutoff frequency of L1 and C1 higher, but the penalty 
would be lower efficiency and an increase in drive requirements for the 
switching transistors. 

We have been assuming that we want a faithful (albeit larger) 
reproduction of the input signal at the load. Strictly speaking, this is not 
really true. In the case of an audio amplifier, the human ear cannot hear 
200kHz, so filtering out high frequencies makes little difference. If the 
application is a servo amplifier, the load is unlikely to respond to such rapid 
fluctuations. 

But there is radiation. Do we want to connect a square-wave of 
200kHz (and its harmonics) across a long speaker cable and let it radiate 
into AM receivers and other electronic equipment? The answer is a clear 
no, and rules and regulations limiting such radiation have been written. 

There are ways to reduce radiation. First, we can keep the speaker 
wires short, moving the amplifier next to the speaker. Second we can vary 
the switching frequency in a random fashion, creating a spread spectrum. 
Although this does not reduce the total radiation, it at least makes it less 
noticeable and allows meeting radiation limits. 


Y¥2 Y1 


Phase / degrees 
roy 
ro) 
Amplitude / db 


For a given supply voltage and speaker impedance, the delivered 
power can be increased by using a bridge output. In figure 14-44 there are 
four switches. S1 and S4 are always "on" and "off" together, as are S2 and 
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S3. Thus the load is 


either connected to +V on 
the left side and -V on the 
right, or vice versa. This 
effectively doubles the 
supply voltage and the 
amplifier can deliver 25 


Watts into an 8-Ohm 


load. There are four large 


output transistors, 


however, each of which 


must carry up to 2.5 
Amperes. 


If we apply 40 
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D3 


-10V 
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Fig. 14-44: Class D amplifier with bridge output. 


Volts total and use a 4- 
Ohm speaker, the output 
power grows to 196 Watts (and the peak current in the four output devices 


to 10 Amperes). 


+ 


OV 


D1 


Input Output 


$3 


Fa 
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-10V 


Imverter 


Triangle 


ety 


Fig. 14-45: Class D amplifier which suppresses the 
fundamental of the switching frequency. 


But now 
let's change the 
circuit a little. 
Instead of having 
the two outputs 
move in opposite 
direction, invert one 
of the drives so that 
they move up an 
down together. If 
the input signal is 
zero, the two 
outputs will move 
at exactly the same 
time. Each output 
then carries a 
200kHz square- 


wave, but between them there is no signal. As the input signal goes 
positive, the duty-cycle of one output increases while the duty-cycle of the 
other output decreases by the same amount. Thus, between the two outputs, 
there is now a square-wave with a duty cycle amounting to the difference. 
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Frequency/kHertz 


100 150 200 250 300 350 400 450 500 


50kHertz/div 


Fig. 14-46: Radiation spectrum across the load in 


figure 14-45. 
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The effect on the 
frequency spectrum is quite 
drastic: the fundamental of the 
switching frequency has 
disappeared; we only need to 
worry about the second 
harmonic, which has a lower 
amplitude and is easier to filter 
out. 

But let's not get too 
enthusiastic here: the 
fundamental of the switching 
frequency is no longer present 
when measured across the 
load, but the wires leading to 
the load move up and down 
together, at the rate of the 


switching frequency. While this movement causes no current to flow 
through the load, there is still capacitive radiation from the wires. Hence 
Cl and C2 are needed. 


A last word about class D amplifiers: simulation is very difficult 


and time-consuming. Unless you have a highly specialized program, only 
transient analysis can be used, which means you cannot obtain such 
parameters as phase margin directly. You may be forced to simulate (and 
integrate) the various blocks in pieces and then resort to old-fashioned 
breadboarding. 
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Chapter 15: Ato D andDtoA 


AtoDandDtoA 


The field of data converters is vast and still expanding. It would be 
presumptuous to cover all of it in one chapter. For this reason only some of 
the most often used approaches are discussed here. 


Digital to Analog Converters 


There is nothing very mysterious about most digital to analog 
converters. Just take a look at the first figure. A string of identical resistors 
divides a reference voltage into eight equal parts. Of eight MOS transistors 
only one is on at a time, connecting the selected tap to the output. 


Vref 


T Analog Out 


| 


Decoder 


Fig. 15-1: 3-bit Divider DAC. 
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Of course this is a very simple 
example, a grand-total of three bits. It 
gets a bit more complicated as you 
increase the number of bits; 256 
resistors and transistors are required for 
8 bits, 1024 for 10 and 4096 for 12. 

Also, this DAC creates analog 
voltages of only one polarity; analog 
signals have the nasty habit of being 
bipolar. To include negative-going 
values we need to double the number of 
resistors and add an identical negative 
reference voltage. Thus an 8-bit Divider 
DAC requires 512 resistor segments. 
For higher number of bits (and the 
expected higher accuracy) the matching 
of the resistors becomes the limiting 
factor; not only does the number of 
resistors increase but each resistor needs 
to be larger to obtain better matching. 
And forget trimming: at 12 bits you 
would have to trim each of the 8192 
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resistors. 

Note that the full reference voltage does not get to the output. This 
quirk is caused by the fact that a string of eight resistors has nine nodes. 
We need to include zero, so three bits only reaches 7/8 of the total voltage. 

To represent bipolar values some special codes are used. In the 
"sign + magnitude" code a bit is added which represents just the polarity. 
This is not the most efficient way and it is somewhat awkward (there are 
two values for zero, 0000 and 1000). 


The offset binary code sae OTsst Twos 
simply starts at the most Magnitude Complement 
negative number and counts up. TERE TEER 
Note that there is only one 0110 0110 
value for zero, but the full 3 : a4 
reference voltage is still not 001 : 
present. 0001 

In the twos complement ° 0000 
code, positive numbers are the 7) 
same as in a binary code, with 3 
the additional sign bit. 5 
Negative numbers are the $ 
inverse or complement of the 8 
positive ones, with a 1 added Fig. 15-2: Codes representing bipolar 
and the sign-bit changed. values. 


Other codes are also 
used for DACs. In the BCD (binary coded decimal) code each decimal 
digit is represented by four binary digits. BCD is primarily used for digital 
voltmeters. The Gray Code changes only one bit at a time, a feature useful 
in shaft encoders. 

In any DAC, the output is strictly proportional to the reference 
voltage; double it and the output will double. Thus if you treat the reference 
terminal as an input, you have what is known as a multiplying DAC. 


To increase the number of bits without exploding the number of 
resistors we can move to a Segmented DAC. Figure 15-3 shows a simple 
example for six bits, divided into three 2-bit segments. In this way we can 
reduce the number of resistors from 64 to 12; for a full-fledged design 
which delivers positive and negative values you would again need to double 
these numbers. 

The first segment selects a resistor rather than a tap and the 
corresponding voltage drop is buffered and delivered to the second segment, 
where the same process is repeated. In the last segment taps are again 
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delivered to the output. Notice that in this approach, too, the top of R9 is 
not connected; a seventh bit would be required to allow that. 

In such a segmented DAC the resistors in the first string (the most 
significant bits) are the most critical. They should be largest in size to 
obtain the best matching (or be trimmed). 

It is crucial that a DAC be monotonic, i.e. as you step through the 
code from low to high, the output always increases (it may not increase by 
precisely the same amount, but at least it will never decrease). A divider 
DAC is always monotonic. The same holds true for the segmented DAC, 
provided each segment is monotonic (which is the case if we use dividers). 


Vret [+ * P> 
" = FS x= , i 
"s to " + nd * > Analog Out 
1 " 1 | 
R3 ; __— i. _—— | ay ‘ ] 
+ 4 = =a 
“ i= | : ca Lie : a 


Vref 


S| a ee 
> 


= *| 0 o 1 1 0 


Fig. 15-3: 6-bit segmented DAC. 


It is not necessary to use only identical resistors. Figure 15-4 shows 
a 6-bit example of a DAC using binary-weighted resistors. Moving from 
the most significant to the least significant bit, the resistors double in value 
each time, thus no decoding circuitry is required. 

Only one resistor (and transistor) is required per bit but the saving is 
largely an illusion. To get good matching, you would need to design all 
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resistors with 

identical segments wet ya 
(1R), which amounts = Rt 
to 63 resistors, one zi i AA 
less than a simple 
resistor string. In an 
addition, the resistor = 
for the most t ws 1 
significant bit ga | 
influences accuracy —- 
the most, so it should 
be larger than the 
others. Also, the 
transistors carry 
current, so their 
resistances appear in series with the binary weighted resistors. This means 
that M1 should not only be very large, but it should be 32 times the size of 
M6. 


Analog Output 


M3. 


Fig. 15-4: DAC with binary weighted resistors. 


A somewhat better idea is the R-2R (Ladder) DAC. Just two 
resistor values are used and the bit lines need not be decoded. 

With the most-significant bit high, the first 2R resistor is connected 
between Vref and the input of the op-amp; all other 2R resistors are 
connected to ground. Each subsequent bit has half the influence on the 
output as the previous one. 

Veet — — — This is an 
ie _ - - e inverting 
amplifier, so the 
‘ output goes 

AL aL aL cL pL eh dh fl negative with a 


positive reference 
tf>e t{>° t}>° te Fc voltage. 

=e Using 

MSB (3B only two resistor 
values improves 
matching and 
trimming is 
easier. Note, however, that the MOS transistors carry current and their 
resistance is critical. Moving from left to right, the current drops by 50% in 
each stage. Thus, to get the smallest error, the transistor size should be 
doubled for each stage moving from right to left. 


Fig. 15-5: DAC with R-2R ladder. 
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In many DACs the analog voltage is created not by voltage taps but 
by currents. An example of a much simplified current DAC is shown in 
figure 15-6. A primary current is generated by R10 from a reference 
voltage; with Vref at 1 Volts, this current amounts to 200uA. QI through 
Q6, being biased from Q1, each produce a fraction of this current, Q2 


Vref 


R11 
10k 


) 


+V 


Analog Out 


n 


witch 


R3 


RS 


R7 


1k 


Fig. 15-6: Bipolar current DAC with R-2R ladder. 


hungry approach. 


1k 


1k 


100uA, Q3 50uA, Q4 
25uA and Q5 12.5uA. 
Q6 is used to terminate 
the ladder. 

These binary- 
weighted currents are 
then switched to either 
R11 or +V (by, for 
example, a differential 
pair, acting as logic 
inputs). Note that the 
currents are flowing all 
the time, which makes 
this a fairly power- 


You can, of course use the current directly as the output. But note 
that, if R10 and R11 are both inside the IC, their temperature coefficients 
and absolute variations will cancel. 

Don't let this simplified example mislead you; there are many 
sources of error which require fine attention to detail. In a bipolar circuit 
there are base currents which must be compensated lest they subtract as 
much as 1% from the ideal values of the binary (collector) currents. And 
each collector must be at exactly the same potential as Q1 (here ground). 


Lastly, if bipolar 
switches are used, they, 
too, will have base 
currents which also 
must be rendered 
harmless. 


Although it 
started out that way, a 
current DAC is no 
longer primarily a 


bipolar affair; in fact 
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R3 
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Fig. 15-7: CMOS current DAC with R-2R ladder. 
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CMOS has some significant advantages here. There are no base currents, 
therefore no base current errors. However, the drain voltages still need to 
be at identical levels, otherwise the Early effect will cause substantial 
deviation. 


The number of bits is limited by the matching of the resistors and 
the sizes of the transistors. At eight bits M1 would consist of 512 
transistors the size of M5 and M6. The latter two are already going to be far 
larger than minimum size to ameliorate the Early effect (making the channel 
long) and get acceptable matching (making the channel wide and long). 
512 of these (twice as many for both polarities) make the area painfully 
large. 

There is a way out of this limitation: segmentation. In our example 


Analog Out 


Fig. 15-8: Segmented current DAC. 


the last transistor is only used for terminating the resistor ladder. It carries 
the same current as the least significant bit, 12.5uA. Use it and split it into 
16 equal parts, 8 used for 5th bit, 4 for the 6th, 2 for the 7th and 1 each for 
the 8th bit and a dummy transistor. 

Simple transistor ratios are used for this extension. We could have 
also employed another R-2R resistor ladder but, since these are the least 
significant bits, the accuracy is most likely sufficient. 
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Analog to Digital Converters 


Analog In As in DACs, the divider analog to 
ues digital converter is the fastest and most 


re dei simple approach. Shown here for just three 
: bits, there are eight comparators with one 


a) input connected to a resistor tap and the 


; other to the analog input. Wherever the 

7 analog signal exceeds the potential at the 

a tap, the output of the comparator goes high; 

Ra the comparator outputs are decoded into 

three bits. If the input can be both positive 

= and negative, twice as many resistors and 

> comparators are needed, as well as a second 
> reference voltage with an equal but 

ih negative value. 

' : All comparators operate 

simultaneously, so the speed of the 

i : converter is given by the speed of the 

Re comparators alone. 

— The disadvantage of this approach is 

again complexity with large number of bits, 

with an accompanying high power 

consumption. At eight bits 512 resistors 

and comparators are required (assuming a bipolar input); at 12 bits the 

number increases to 8192. Also note that all comparator inputs are in 

parallel, which makes for a rather large input capacitance. 


Fig. 15-9: Divider ADC. 


The Successive 

Approximation ADC on |_vret 
reduces the number of 

comparators to one, though rates ee 
the number of resistors Te ; = 
remains unchanged (Figure ae | + 
15-10). Here a sample of Encoder 
the analog signal is taken sepsis onan 
and held steady while the 

conversion takes place. Fig. 15-10: Successive approximation ADC. 


The heart of the ADC is a 


Digital Output 
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DAC. The control logic sets the DAC through a register to a likely value 
initially. If the value is too high, the register is moved down; if the initial 
guess is too low, the register is moved up. After a few steps the correct 
setting is found and the conversion stops. 

All this guessing and stepping takes time, thus a successive 
approximation ADC is considerably slower than the divider approach, 
though it consumes less current and takes up a smaller area. 

In both approaches the accuracy is limited by the resistor (or 
capacitor) divider, as was pointed out in the DAC section. 


The Delta-Sigma Converter 


Most engineers find explanations of the sigma-delta converter 
almost incomprehensible. There is a reason for this: terms are used which 
are quite non-descriptive and often misleading. 

Take a look at a 
conventional diagram for a first- 
order delta-sigma ADC (Figure 15- 
11). This circuits has a "one-bit" 
output, which is more a riddle than 
a description Figure 15-12 shows 
the same function with more 
familiar blocks, which makes the 
circuit far easier to understand. 


Fig. 15-11: Conventional diagram for a 
delta-sigma converter. 


1 
7 BS 
Analog In our 10p 
> Comp. 
a . Pulse 
> Train 


$1 t 


Fig. 15-2: First-order delta-sigma ADC with more familiar blocks. 


The object is to produce a train of logic pulses at the output whose 
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frequency is proportional to the (analog) input voltage. Counting the 
number of pulses for a given time interval then gives the equivalent digital 
value, i.e. this is a voltage to frequency converter. 

Let's start on the left. There are two resistors leading to the input of 
an integrator. R2 is connected through a switch to a negative reference 
voltage (of 2 Volts), while R1 responds to the input signal (with a range of 
0 to 1 Volt). If switch S1 is open, the positive input voltage causes the 
output of the integrator to move negative at a rate given by R1 and Cl (this 
is an inverting integrator). 

The following comparator then triggers a timer when this falling 
voltage reaches the bias level (which is set here at zero volts but can be any 
convenient level). The timer produces a short pulse (e.g. lusec) which is 
delivered to the output. 

This pulse also closes the switch. Since the two resistors are equal 
in value (an arbitrary choice) and Vref is at least twice the value of the input 
voltage, the integrator reverses during this time and its output moves up. 

If the input voltage is 1 Volt, the output of the integrator moves 
positive during the pulse by exactly the same amount as it has moved 
negative while the switch was off. Thus the duty cycle is 50% and the 
frequency 500kHz (remember that R1 is connected all the time, while R2 is 
connected only half the time, hence Vref needs to be at twice the level of 
the maximum input signal). 

If we lower the input signal to 100mV, the falling portion of the 
integrator output becomes longer (while the rising portion remains constant) 
and the frequency drops to 50kHz. At 10mV input the frequency is 5kHz 
and at ImV 500Hz. At zero input the oscillation stops entirely. 

As in the examples above, this circuit can only handle positive input 
voltages. For a bipolar input (say + 1 Volt), R2 is switched between two 
reference voltages, +2V and -2V. 

So, the mysterious "1 Bit DAC" turns out to be nothing more than a 
switch and one or two reference voltages, the "Latched Comparator" a timer 
triggered by a comparator and the "Summer" two resistors (it actually is a 
subtractor). The "delta-sigma modulator" (which purists insist should not 
be called a sigma-delta modulator) is simply a voltage to frequency 
converter. And the "1 Bit Output" translates into serial output. 

For accuracy two factors are of overwhelming importance: the 
reference voltage and the pulse-width. Reference voltages can be trimmed 
and the pulse-width can be derived from a crystal-controlled clock. All 
other elements are of secondary importance. R1 and R2 need to match well, 
but their absolute values (and that of C1) only affect the height of the 
triangle wave at the output of the integrator, not the timing (the rising and 
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falling flanks of the triangle wave are equally affected and thus cancel out). 
A high loop gain in the integrator op-amp assures that the voltage 
fluctuation at its input is down to a few microvolts (but its offset voltage 
still matters). 

The performance of a delta-sigma ADC can be improved by adding 
one (or more) feedback 
loops (Figure 15-13, 
shown again in the 
conventional way). Be 
aware, however, that 
the higher the order, the 
less stable the design 
becomes. A third-order 
delta-sigma ADC can 
oscillate in some Fig. 15-13: Second-order Delta-Sigma ADC. 
unexpected ways. 

The significant advantage of the delta-sigma ADC is its capacity for 
resolution. These circuits are often called "oversampling ADCs", because 
they must sample the incoming waveform above the Nyquist rate (twice the 
maximum input frequency). For example, if you want to capture a 1kHz 
signal with an 8-bit resolution, the maximum frequency must be at least 
2kHz times 256, or 512kHz. At 12 bits this frequency increases to 8.2MHz. 

However, this presumes that we do nothing else than counting 
pulses at the output, which ignores much of the concept's capability. The 
delta-sigma ADC was made for CMOS and with today's small geometries a 
great deal of digital signal processing can be done once the pulses exist. 
Apart from increasing the resolution, the sampling noise (which is already 
centered around a rather high clock frequency) can be brought down to 
stunningly low levels with a sophisticated digital low-pass filter (called a 
"decimation" filter). 

With these additional measures, a second-order delta-sigma ADC 
with a signal bandwidth of 4kHz can achieve a 14-bit resolution with 85dB 
signal to noise ratio, using a clock frequency of 1MHz. 
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16 Odds and Ends 


In this, the last circuit design chapter, we look at six functions which 
did not fit well into the previous subjects. 

As pointed out before, be aware that you will need to re-simulate 
these circuits with models specific to the process to be used. 


The Gilbert Cell 


An unusual and brilliant idea: 
Take two current mirrors and connect 
them in a differential way. Run the inner ey JQ" 
pair at a higher current than the diode- 50u a0 
connected transistors and you get gain, ee 
roughly in the ratio of the currents. [2 and 
I3 are modulated, the collector currents of 
Q2 and Q3 are the outputs. | " 

Since this is a current in/current 
out scheme, the cell is fast (no Miller 
effect). 

In the second form of the Gilbert 


Fig. 16-1: The first form of a 
Gilbert cell. 


se! vec cell all three currents 
= flow to the negative 
rail, which allows 
stacking: use the 

J t collector currents as 
ag oH rKo ah the inputs for the next 
a er | cell. Each subsequent 
cell is biased at a 


18. | A \s:. ' . ‘ 3 higher DC potential to 
6 8 is avoid saturating any of 
the transistors. 


In both forms 
Fig. 16-2: Second form of Fig. 16-3: And the third there is a small error 
the Gilbert cell. form of the Gilbert cell. due to the base 
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currents, which is largely eliminated in the third form. 

Alas, all this is true only in an isolated, theoretical analysis. It is 
very rare that you start out with a differential current input. In most 
applications there is an input voltage, and single-ended at that. So, to use 
the Gilbert cell, you need to convert this voltage into a differential current, 


Fig. 16-4: A practical application of 
the Gilbert cell. 


This problem is made 
worse by stacking several cells, 
requiring a wide range of 
current levels. In today's 
power-conscious and low- 
voltage environment the 
Gilbert cell has become 
outdated. 
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for example with a differential pair, as is 
shown in figure 16-4. 

And here is where the Gilbert cell 
falls down. As shown in figure 16-5, a 
differential pair actually has a higher gain 
and wider frequency response (if operated 
at I1) without the Gilbert cell. Which is 
why it is rarely used. This is not due to 
the fact that the current is converted into a 
voltage at the output (and thus the Miller 
effect is right back in the picture), but 
simply because Q1 and Q4 need to run at 
a lower current. 


With Gilbert Cell 


Differential Paif Only 


Differential Gain / dB 


1k 10k 100k 1M 10M 100M 


Frequency / Hertz 


Fig. 16-5: In most applications the Gilbert cell 
does not actually enhance performance. 
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Multipliers 


v2 


al 


SUB 


Fig. 16-6: Simple four-quadrant 
multiplier. 


Such a circuit is 
called a four-quadrant 
multiplier because it 
produces an output for all 
four quadrants of a plot: 1. 
both inputs positive; 2. V1 
positive, V2 negative; 3. both 
inputs negative and 4. V1 
negative, V2 positive (where 
the value of V2 is applied 
with respect to 2.5V), 

The range of the two 
input voltages is + 100mV, 
resulting in a maximum 
output of + 10mV. This limits 
the achievable accuracy since 


Differential Output / mV 
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We have seen a similar circuit 
before, used as a phase detector for a 
PLL. While accuracy in that application 
was of minor importance, in a multiplier 
it is the main feature, 

The circuit requires a split power 
supply (e.g. +5 Volts), so that at least 
one input (V1) can be at ground level. 
The second input (V2) is biased safely 
higher (2.5V) to avoid saturating Q1 
and Q2. 

It is the insertion of resistors in 
the emitters of all six transistors that 
gives this multiplier its accuracy. Their 
values need to be large compared to the 
dynamic emitter resistance (re, see page 
4-1). 


Vi/mV 


20mV/div 


Fig. 16-7: Behavior of the four-quadrant multiplier. 


matching of VBEs becomes as important a factor as matching of the 
resistors. A higher positive supply voltage is needed to allow input and 


output ranges of + | Volt. 


Edition February 2005 


16-3 


All rights reserved 


Camenzind: Designing Analog Chips Chapter 16: Odds and Ends 


As shown, the error can be as high as + 5% untrimmed and + 1% 
trimmed. With a higher supply and trimmed thin-film resistors + 0.3% is 
possible. 


T ee By adding two 
R7 R8 RQ . . . 
10k 20k 220k transistors and biasing the 
» {Jou upper quad (Q7 to Q10) with 
t Out diode-connected transistors 
ee a ae - ob (Q5, Q6, sitting ata DC 
‘ ab i potential set by R7), both 
' | 2 input can be at ground level. 
Also, the ranges of the two 
oR a inputs and the output are 
et = - — now extended to + | Volt. 
- a. + = - . Accuracy is unchanged for 
a aa ae untrimmed operation; with 
‘pou J 1000 Apis cy, S00 trimmed thin-film resistors 
18" 8 a 8 8 , and additional temperature 


compensation (see reference) 
such a circuit can be brought 
to within 0.1%. 


Fig. 16-8: Four-quadrant multiplier with both input 
voltages at ground level. 


Figure 16-9 . +1.5V 
shows the equivalent R7 Re Re 
: o 20k 20k 20k 
circuit in CMOS, 
designed for a 0.35u ines - 
- ut 
process. Because of the Ms| M6 Mz Me Mg Iao 
lower supply voltages a | E | St | | = } 
the range of the input ceegt ae) oe ee 
voltages is again limited | Med et ae Med 
to + 100mV. f (=) 
This circuit 
= M1 M2 M3 M4 
illustrates the 7 | | ws | | 
performance limitations w=20) Wi=20u w=20} Wi=20u 
: L=0.5u =0.5u L=0.5u =0.5u 
imposed by low supply oA veel -4 Maa —L— 
voltages. With offset Rt R2 RS R4 
10k 0k 10k Ok 
voltages generally . He 
being higher in CMOS Yotk | [wow f 
and the maximum i l2 13 \4 
output range a mere + [Bre {Give {Orn ae 
- = 1.5V 


10mV, untrimmed 
accuracy is no better 


than about + 10%. You Fig. 16-9: Four-quadrant CMOS multiplier. 
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can, of course, change the resistor ratios so that the relationship between 
inputs and output is multiplied by a constant. 


Peak Detectors 


Peak detectors tend to be a bit tricky. A surprising number of the 
schemes using an op-amp and a diode tend toward oscillation or other 
misbehavior. 

When you analyze the feedback loop, you find that the diode and the 
capacitor at the output place an unusual burden on the op-amp. When the 
signal moves up (for a positive peak detector) the output is connected to a 
large capacitance. When the signal moves down, there is no load at all. 

The circuit in 
sv figure 16-10 uses a bipolar 
as Darlington pair to provide 
a high-impedance input at 
the (external) capacitor. 
tke For operation over a wide 
Rs temperature range the 
In outer transistors (Q1, Q4) 


10u ] 10u |“ a6 
iN 


aa T Kor 7 T bc, are biased at about 0.8uA 
ot rKo2 4 by Q9 and Q11. 
= t | a The output 
: a2 Kars 1oon impedance of the op amp 
| is artificially enlarged by 
Te oo lato R3 to provide frequency 
08g a {a kat = compensation (together 
with Cl). 
*Osup 3 ; There is a 
fundamental question to 
every peak detector: How 
Fig. 16-10: Peak detector with Differential NPN long should the voltage 
Paringiom pal stay on the capacitor? If 
the answer is "forever" 
then the detector displays the highest peak for an infinite history of input 
signals, which is probably not what you want. For any other answer you 
have two choices: either discharge C1 slowly, so the voltage stays within 
the desired accuracy over the time of interest, or reset C1 before each 
measurement. 
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In our circuit the capacitor is discharged by the base current of Q4. 
This current amounts to about 5nA. If the peak voltage is 1 Volt, you lose 
about 1% in 200msec with 100nF of capacitance. The discharge current 
varies from chip to chip and with temperature. 

Figure 16-11 shows a peak detector which operates from a single 
supply voltage. The Darlington input pair of the op-amp is more elaborate 
and the operating 
currents of the 


outer transistors \S - | H \S M 
(Q1, Q6) are si 
merely the base | pos ah | t—Karo 
currents of the 
inner ones. This we 
limits the fo ee OT a6 #|_|Peak 
temperature range -— 
(to about 100°C) - T 
but lowers the = on lide at ee 
input current. tosue a 
Notice the 


discharge resistor 
R1. Without it Fig. 16-11: Single-supply peak detector with a lower input 
> current. 


" = = +5V 


the base current of 
Q6 would charge C1 (it flows out of the base). Thus, in this circuit, a 
controlled rate of discharge through R1 is essential. 
P ; ee In both examples the supply 
2 Q | voltage must be lower than the 
10u M6 emitter-base breakdown voltage of 
"| +— the output transistor. If it is to be 
wor == higher, use an additional diode in 
RI series with the emitter. 
i ai Me CMOS devices are much 
* +_]reak better suited for peak detector 
vay W100 4g design than bipolar ones for two 
| 1 reasons: 1) there is no (DC) input 
[w w-8d ~ | 1007 current and 2) you can reset a 
| M=2 capacitor to zero volts (the 

Le collector-emitter voltage of bipolar 
== transistors does not go to zero, there 

is always a remaining voltage of 
Fig. 16-12: CMOS peak detector. about 100 or 150mV). 

As in the bipolar examples, 
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the feedback loop is compensated with a resistor in the output path, working 
together with Cext. 

Since there is no input current, C1 can be made quite small, to the 
point where it can be internal. But be aware that the smaller C1 the more 
difficult it becomes to compensate the feedback loop. 


Rectifiers and Averaging Circuits 


Fig. 16-13: Standard op-amp half- 


wave rectifier. 


abrupt impedance change 
around zero signal level can 
easily cause spikes and damped 
oscillation, affecting the 
accuracy. Specifically 
designed circuits avoid this - 
and require only a single 
supply voltage. 


In/V 


Out/V 


Figure 16-13 shows the standard 
configuration for a half-wave rectifier, 
appearing without much comment in 
dozens of text-books, usually without 
mentioning that it does not work well with 
many op-amps. 

Putting a diode in the feedback 
path is awfully hard on the op-amp. The 


+5V 


Q5 


2k 


TOsus 


Fig. 16-14: Bipolar half-wave rectifier. 


In the circuit of figure 16-14 


the output is at ground (without an 


0 02 04 06 08 1 1.2 14 #16 18 


Time/mSecs 


Fig. 16-15: Input and output waveforms. 
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200ySecs/div 


input signal), held there by R1. If the 
input moves above ground, the output 
follows. But if the input goes 
negative, there is nothing in the output 
stage that can pull it below ground, so 
it just stays there. 


2 
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The value of RI must be low enough to keep the voltage drop due to 
the base current of Q6 low. This resistor cannot be replaced with a current 
sink; the minimum collector-emitter voltage of an NPN transistor is too 
high. 

You can capacitively couple the input signal, with a resistor 
connected from the input terminal to ground to provide a dc path. 

Minimum required supply voltage for a 1-Volt input range is 3.5V. 

In figure 16-16 . . 
an inverting op-amp a a 
configuration with a 
gain of | is used, but it att 
only works for T SG 
negative-going input . 
signals. As the signal is", Gas 
moves above ground, aN Zt a L ea 
the op-amp is mS y, 
effectively disabled. 28k C1 
Thus the output simply I il Kor 
follows the input. To Tosus 
avoid loading down the od ibe iC 
output, a buffer needs to 
be used. As it happens, 


the circuit in figure 16- Fig. 16-16: A full-wave rectifier. 
14 is an excellent 


candidate for this job. 


Q10 gives a small 
. operating current to Q1 and Q4 
ue (about 1.7uA). Without this, 
= 4 frequency compensation of the 
a op-amp becomes very difficult. 
4 Minimum supply voltage 
is for a 1Vp input is 2 Volts. 
3 . Both of these circuits can 
08 be readily translated into 
. CMOS. The half-wave rectifier 


0 02 #04 06 08 1 12 414 #16 1.8 2 . . 
in figure 16-18 uses two 


Fig. 16-17: Input and output waveforms for the advantages of CMOS: first, 

full-wave rectifier. there is no base current, so the 
output pull-down impedance can be quite high; second, a current sink can 
be used at the output instead of a resistor. 
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With a 1-Volt input range this circuit works down to 1.8V supply at 
-40°C, 1.6V at 0°C. 

The full-wave rectifier of figure 16-19 only needs a 1-Volt supply 
for the same input range. 


= = = +3.3V 
\8? on 
10u 10u 
M6 
1 
ct | 
—— W=10u 
M1 7 M2 ig |e 
i iii] R1 
7 7 
| | 20k T 
w=10l W=10u 
L=0.35u —_b0.35u M5} Out 
@ 7 | 
M3 M4 R2 Weed. 
Lad 20k L=0.35u 
| | M=2 
We5u W=5 
LLO.35u L=0.35u 
AC 


Fig. 16-19: Single-supply CMOS full-wave rectifier. 
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Averaging the obtained rectified fundamentally takes time; the longer the 


70 


Full-Wave 
Rectifier 


mV 


Half-Wave 
Rectifier 


0 20 40 60 80 100 


Time/mSecs 20mSecs/div 


Fig. 16-20: Time constant and ripple for a 
one-pole low-pass filter used for averaging. 


Thermometers 


time constant, the smaller 
the ripple (but there will 
always be a ripple). 

Take the most 
simple approach, a single 
low-pass filter (RC) 
connected to the output. A 
full-wave output has the 
advantage, producing less 
ripple. The time constant 
used here is 10msec and 
the signal is 1kHz. 

To reduce the ripple 
you can increase the time 
constant (which takes 
longer to reach the final 
level), or use a higher- 
order filter. 


The PTAT (proportional to absolute temperature) current source has 


come up before (see page 5-4) . It is unusual 
and remarkable that we are able to produce a 
voltage whose value is directly tied to the 
absolute temperature scale and whose accuracy 
depends only on ratios, not affected by any 
process parameter. 

Figure 16-21 shows such a circuit. Q1 
through Q6 form a loop, started up by leakage 
alone (see page 5-6). For safety (in case the 
models are not quite correct) a substantial 
junction (D1) can be added, which has more 
leakage current than any of the other devices. 

The current in the loop is determined by 
the emitter ratios of Q2 to QI and R1. Recall the 
formula on page 5-3: 


Edition February 2005 16-10 


2.5V 


R5 
15k 
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Fig. 16-21: Thermometer 
with Kelvin scale. 
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* * 
deltaVBE = ad on 
q A2* 11 


I] and 12 are identical (as produced by Q5, Q6, R3 and R4) and the 
area ratio is 24. Thus deltaVBE amounts to roughly 83mV at 300K. Since 
T is in Kelvin, the current increases linearly from zero at absolute zero to 
22uA at 300K. This current is then mirrored by Q7 and causes a voltage 
drop across R2. With the values chosen (and perfect matching), the output 
voltage amounts to 1mV per Kelvin. Note that any temperature coefficient 
or absolute variation in R1 is eliminated by a matching R2. 

Although the design is relatively insensitive to power supply 
variation, accuracy is maximized by powering the thermometer from a 
reference voltage. 

Matching is the all-important factor here. A +1% resistor matching 
variation will result in an error of +3°C at room temperature. Adding 
mismatching of VBE and hFE, you must expect a variation of up to +5°C 
untrimmed. With trimming an accuracy of +0.5°C is possible. 


2.5V[_}* . . The centigrade (or Celsius) scale is 
see a the same as the Kelvin one, except 
as for the zero set to 273.16K. So, all 
IS a. we need to do is to create an offset 
voltage of 273.16mV and read the 
D1 1mV/K . . 
t temperature differentially. 
= (Similarly we can create a 
- a4 | *="*"" Fahrenheit scale by increasing R2 by 
—* a factor of 1.8 and setting the offset 
OK tne Lr to 459.67mV). 
I Pe ase Trimming is straightforward: 
date alah R2 sets the slope; trim it to read 


293.16mV at 20°C at the upper 
= = . 7 om, . . 
Fig 16-22: Athermomieterwitha terminal; R6 sets OC; trim it to read 


Celsius scale. 273.16mV at the lower terminal, also 
at 20°C. 


As we have seen in chapter 7, substrate PNP transistors can be used 
in CMOS to create a delta- VBE. Figure 16-23 uses this approach to 
produce a Kelvin output. The current mirror is that of figure 3-25. 

The untrimmed accuracy is somewhat worse in CMOS because of 
the poorer matching of the transistors (for the same area), about + 7°C at 
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Ma MS M7 
L ia <a <a 
am in 
W=20u w=20h w=20l 
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M4 M6 MB 
Ee HK! b<! 
= | ; 
| | 
20u W=20 W=20 
L=2u L=2u L=2u 
M=5 M=5 M=5 
L__y 
R3 
10k 
1mViK 
+ 7 
—— = 
M1 M2 
| | | 
| Mee 
Vi-20u Ww=20 
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—p1 
a RI R2 
5k 25k 
yar a2 \I 
1 10 
Lf | 


Fig. 16-23: CMOS Thermometer. 
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room temperature. You can improve this 
by using a larger emitter ratio for Q2/Q1 
and generally larger devices. 


But let's not forget the lowly diode. Its 
forward voltage has a predictable 
temperature dependence (about -2mV/°C). 
The slope is subject to absolute variation 
and not quite as linear as that of a delta- 
VBE, but the device is nevertheless useful 
in some applications. For example, if you 
want to evaluate the temperature at a 
particular spot on an IC (say next to a 
power device), use a 
diode-connected a 
transistor and connect 

it to a small probing K cK 
pad. You can then = = 
calibrate it first 


: : Fig. 16-24: 
without powering up Diode 
the chip. thermometers 


Zero-Crossing Detectors 


AC 


Suppose you need to start a timer or counter at the exact moment 
when the line voltage crosses the zero line. How do you determine this 
point without bringing the line voltage into the IC? A simple external 
resistor will do the trick. 


a Vec In a bipolar 


design you can use the 


(usually unpleasant) 
Pulse fact that a transistor 


} pair used in the first 
Bo K y Cy aio Stage can cut off the 


200m = transistor following in 


ce the second stage. In 
figure 16-25 one input 
of a differential pair is 


Fig. 16-25: Bipolar zero-crossing detector. 
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200mV (derived with a voltage divider from either a reference voltage or 
from the supply). When the other input is above 200mV, Q9 is turned off 
and the output is low. As you lower the input voltage, the output goes high 
at 200mV, when Q5 and Q9 turn on; but as the input drops below ground, 
Q5 cuts off Q9 and the output drops again. In other words: there is a small 
window at ground level where the output goes high, otherwise it is always 
low. 

Q1 clamps the positive-going AC voltage so it can do no damage to 
the IC. The negative-going waveform is automatically clamped by base- 
substrate diode of Q5. The power dissipation in the external resistor is 
25mW. 

Be prepared for a surprise 
when you simulate such a circuit: 2 
at first you can't see the output 
pulse, because it is very small 2. 
and short compared to the AC 
waveform. > 4, 

With 110V and Rext = P 
470kQ, the pulse is about Susec . 
wide. You get the same width at 
220V if you double the value of 


8.24 8.26 8.28 83 8.32 8.34 8.36 8.38 84 8.42 
Rext. 


Time/mSecs 20uSecs/div 


The circuit works with a 


supply as low as 1.2 Volts. 
Fig. 16-26: AC slope and output pulse. 


The effect used to create the window does not work well in CMOS. 
Instead we can employ two comparators, one biased at about +200mV (M9, 
M10) and the other at ground (figure 16-27). Their outputs are then 
supplied to an "and" gate (M17, M18), which drives the output. 

The AC waveform is clamped in the positive direction by two 
"diode-connected" transistors (M1, M2) and in the negative direction by a 
substrate diode. 

Since there is no base current, you could theoretically make Rext. 
very large. But also consider that the devices connected to the input 
(including the pad and the ESD protection device) have a small amount of 
capacitance. The time constant formed by the external resistor and this 
capacitance must be smaller than the desired pulse-width. 
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ah at = i = a 1.8V 
M3 M4 MB M12 M14 M16 
| | | | | 
stu | Wet w=td wal w=td Wet 
tko.18u | L=0.1gu L=0.18u L=0.1du | L=0.18u Leorgu = =M19) 
= = | 
W=2u 
wo | | Mio i265 
eal | 
lol W=10u Ma M20] Pulse 
In - L=0.14u to.18u lg | 
AC Rext p— ‘* ' | | aH 
a m oe | & | a 200m ! W=1u 
iMeg | | wea] L=0.51) 
ai W=10u L=0.1du 
Mi - L=0.18u tko.18u ai Mig M=2 
Zs | 
Ss 
w=10l 10u We2u w-2f M18 
L=5u \s" LL0.18u L=0.1du ih | 
| M7 ee M15 at 
M2 | L=0.1gu 
| ee 
=2u W=2u 
w=10 tLo.t8u L=0.18u 
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Fig. 16-27: CMOS version of zero-crossing detector. 
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17 Layout 


The layout of analog ICs has so far remained an art, there are no 
computer programs which could design, place and route the components in 
an intelligent, competent way. And, more often than not, the person who 
created the circuit diagram needs to (or should) get involved. 

This chapter is by no means a complete guide; it would take an 
entire book to do the subject justice. Look at it as some hints stemming 
from practical experience. 


Bipolar Transistors 


The minimum sequence of masks in 
a bipolar process 1s: 
Buried Layer 
Isolation 
Sinker 
Base 

Contacts Emitter (N+) 

Contact 

Pad 
The last mask opens up windows 
Buried Layer Sinker over the bonding pads in a thick glass layer 
which is spread over the entire circuit to 
protect the delicate metal. 

Note that the emitter (N+) mask is 
also used to make a low-resistance contact 
to the collector (the epitaxial layer). 

To these seven basic masks several other may be added: 

A second (often identical) isolation mask, applied after the buried 
layer (but before epitaxial growth), to implant p-type regions which diffuse 
upward (up-down isolation). 

A separate mask for high-value (implanted) resistors. 


Isolation 


Fig. 17-1: Mask layers for an 
NPN transistor. 
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A mask for Schottky diodes, which can consist either of Aluminum 
or barrier metals directly in contact with the epi layer. 

An additional mask for P+ regions, sometimes used to improve the 
performance of lateral PNP transistors. 

A mask for thin-film resistors. 

Occasionally a washed emitter is used. Here the emitter diffusion 
(or more likely, implant) takes place through the N+ contact openings 
(while the windows to the P-regions are masked off). After creating the N+ 
regions, the thin oxide layer over them is simply etched (or washed) off 
without a mask. In this way the emitter area can be made smaller, because 
it is self-aligned with the contact window. 

The dimensions of the mask patterns are determined by the process; 
some of the factors are: 

Minimum size of contacts; determined by how small a window can 
be etched into the oxide. Most of the small-geometry processes require all 
contacts to be of identical size. 

Distance between emitter and base contacts; usually given by the 
minimum required spacing of the metal covering them. 

Overlap of metal over contacts; determined by how well the metal 
can be aligned to the contact. 

Spacing between sinker and base; set by the sideways diffusion of 
the sinker (and base) and the depletion layer width for the maximum 
voltage. 

Spacing between base and isolation; determined by the sideways 
diffusion of the isolation (and base) and two depletion regions. 

Spacing between sinker and isolation; must accommodate two 
sideways diffusions and one depletion region. 

Spacing between buried layer and isolation; must allow for the 
sideways diffusions of both the isolation and the 
buried layer. After epitaxial growth the image of 
the buried layer at the surface is blurred and shifted 
along the crystal axis (see page 1-17) and thus 
results in the least accurate alignment. 

The isolation mask is a special case. The 
diffusion takes place between the devices, but it 
would be awkward to draw such a complex web. 
Thus a convention has been established to draw the 
isolation region where it is not, and then invert the 
pattern on the mask. 

There are several choices for the design of Fig. 17-2: Isolation 
an NPN transistor; figure 17-3 shows some of them. pattern. 
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In the top pattern the emitter is in the center, the base 
contact on the right and the collector contact on the 
left. In the pattern below, emitter and base contact 
are reversed. There is a slight advantage to having 
the emitter closer to the collector contact, in that the 
distance the current has to travel in the buried layer 
to a point underneath the emitter is reduced. (For 
clarity, the buried layer and sinker patterns have 
been omitted). 

You will also see NPN transistor patterns 
with more space between emitter and base or base 
and collector to accommodate metal lines. 

In the third pattern the collector contact has 
been moved, resulting in a somewhat lower 
saturation voltage. 

The bottom pattern contains two base 


ee contacts, effectively doubling 
the current capability of the 

. . devices (remember that the 
= = maximum current is given by 

. _ the effective emitter length, 
CSE i.e. the periphery of the emitter 


facing the base contact; the 
rest of the emitter area is 
ineffective at high currents). 

Figure 17-4 shows NPN 
transistors with two emitters in 
a single island (or tub). In the top pattern collector 
and base are common, i.e. connected together, which 
limits the usefulness of the device. 

In the center pattern there are separate bases, 
only the collector is common. The bottom pattern is 
identical, with the contacts redrawn for uniform size. 

There is a danger with multiple emitters in the 
same island. As mentioned before, the image of the 
buried layer is shifted (in all but low-pressure 
epitaxial processes). The actual buried layer region is 
where it is supposed to be, a rectangle covering the 
area from the collector contact (and sinker) to the far 


Fig. 17-3: Small NPN 
transistor patterns. 


Fig. 17-4: Two- 
- . : emitter NPN 
edges of the base regions. But the image appearing transistors. 


on the surface is shifted (along the crystal axis). In 
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<111> silicon starting material, with the wafer- 
flat at the bottom, the epi-shift is to the right 
(figure 17-5). The amount of shift is roughly 
equal to the thickness of the epi-layer. What we 
see on the surface is a depression, caused by a 
slight consumption of silicon during the 
diffusion of the buried layer. Thus it is likely 
that this step in surface height will hit the left 
emitter, but not the right one, which influences 
their matching. 


This effect can be avoided if the entire Fig. 17-5: Epi-shift 
influencing one emitter 


but not the other. 


Wroay cay, 
LOIS 


SS ; transistor is rotated so the 
| edge of the shifted 
a 2) pattern falls between the 
collector contact and the 
SSSA bases. Figure 17-6 
i Ee eon shows this with a two- 
transistor layout. The 
center transistor has a 
single emitter, the outer 
one 16 (used, for 
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Fig. 17-6: 16:1 emitter ratio with epi-shift mismatch example, in a bandgap 
avoided. reference). 
The transistor patterns EE 


setetetetetenstst 
See 
seecesesetesetets 


shown so far are all intended 
for smallest possible size, 
which naturally limits how RS) poe 
much current they can carry. 

To increase the current 
capability, the effective 
emitter length needs to be 
increased. For the NPN 
transistor in figure 17-7 not 
only have the emitters been 
tripled and stretched, but base 
contacts have been placed on 
both long sides. Note that the 
increase in current capability Fig. 17-7: NPN transistor for higher current. 
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almost always requires wider metal runs for both the emitter and collector 
(see box on page 14-5). 


An alternate design is 
fe shown in figure 17-8, with 


Satara 
ss 
eS 


the uniform contact openings 
required by dense processes. 
There are two collector 
contacts (on the outside), 
three emitters and four base 
contact columns. 

If you increase the 
size of such a transistor 
further, there comes a point 
where it is of advantage to 


senor ease 
Sseiee SER, 


S 


saeatecene 
seseitacseeest 


5 
9 


see 
2x 
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9 
states 


sear 


ast 
se 
sefone 


SS 
ROO 


irene 
RRM 


2? 


ae taper both the emitter and 
ee ce 8 collector metal, er adually 
increasing the width as the 
Fig. 17-8: Alternate high-current design. currents from more and more 


contacts are added. 


Lateral PNP Transistors 


The emitter of a lateral PNP 
transistor (figure 17-9) is in the center, the 
dark contact in a p-type (NPN base) 
diffusion. It is surrounded by the collector, 
another p-type region. The distance 
between the outer edge of the emitter and 
the inner edge of the collector is the base- 
width. Since both of these regions are on 
the same mask, emitter and collector are 
self-aligning and the base-width tends to be 
very accurate. It needs to be large enough 
to accommodate the two sideways 
diffusions and the depletion region 
spreading from the collector toward the 
emitter. 

By extending the emitter metal so 
that it covers the entire base, a field plate 
is created (always connected to the emitter, which has the highest positive 


EE 


Fig. 17-9: Lateral PNP transistors. 
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voltage). This field plate improves the gain of the transistor at low current 
by keeping p-type charges away from the surface. (In a CMOS process, the 
poly layer is used as a field plate, also connected to the emitter). 

Although a circular emitter results in a uniform base-width and thus 
(theoretically) produces the highest possible gain, there is actually very little 
enhancement over the more simple square one. 

The third terminal (at the bottom) is the base contact, identical to a 
collector contact for an NPN transistor. 

For a lateral PNP transistor (in a bipolar process) the presence of a 
buried layer is essential. Without it, the substrate (connected to the most 
negative supply) would be just as attractive a collector as the intended one; 
i.e. about half the emitter current would flow to the collector, the other half 
to the substrate. 

The dual pattern at the bottom of figure 17-9 should be avoided. It 
looks attractive, especially since lateral PNP transistors often have common 
bases (e.g. in current mirrors), but the two devices influence each other, 
especially in saturation. 


Resistors 


In the case of a diffused resistor there 
is always a surrounding semiconductor 
region (the epitaxial layer for a bipolar 
process, a well or the substrate in CMOS). 
The surrounding region needs to be at a 
potential so that the junction is reverse- 
biased. This bias voltage causes a depletion 
layer to extend into the resistor, decreasing 
its cross-section and thus increasing its 
resistance. For a base diffusion this effect is 
small but occasionally not negligible (about 
1%); for an implanted resistor it can be very 
large (20%). 

Thus, if two diffused resistors form a 
Fig. 17-10: Diffused resistors  VOltage divider, the difference between the 

in a common (top) and in bias voltage and the resistor voltage is larger 
eaparale (peltom) ilands. for the lower resistor than the upper one, 
resulting in a shift of the divider ratio. It may 
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be small enough to ignore for resistors with 200 Ohms/square (about 0.2%), 
but for implanted resistors this error is almost always significant. Note that 
this is an initial error only; it is not subject to change during production. 

To avoid this effect, you can place each resistor in its own tub and 


connect the tub to the positive end of the resistor. 


Routing the metal also needs 
some thought. The Seebeck coefficient 
(see page 1-31) creates a small voltage 
between junctions of the same material, 
located at different temperatures. For 
this reason it is of advantage to keep 
connections close together, so that a 
thermal gradient will have the smallest 
effect. Figure 17-11 shows a pair of 
resistors with three sections each, 


connected on the same side to obtain the Fig. 17-11: Intermingling and 


shortest distance. For optimum 
matching in the presence of a thermal 
gradient the sections also alternate. 


CMOS Transistors 


connection of matching resistors. 


With the basic layers only, the layout of 
an n-channel transistor is quite simple: there 
are only three patterns. The first pattern is the 
poly gate (sitting on top of thin oxide). The 
second pattern delineates an N+ implant; it is 
simply a rectangle, protruding on either side of 
the poly shape. The n-type dopants enter the p- 
type silicon underneath (the substrate) only 
outside the gate area; 


they are stopped by the 


WY . poly-silicon layer. The third mask places 
Ui . contact opening in the poly and implanted 
regions. 
iN For a p-channel device two additional 
Gi masks are required: one for an n-well 
(surrounding the device or several devices) and 
Fig. 17-12: Layout of n- one for the P+ implant. The n-well must be 


channel (top) and p- 


: contacted and biased. 
channel transistors. 
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The patterns in figure 17-12 show long channels, as often required 
in analog design; the channel length is from left to right, the channel width 
from top to bottom. 

Alas, if things only were that simple. In reality the layout of CMOS 
IC always involves a large number of masks. There are such layers as field 
implant, threshold implant, poly 2, poly 3, metal 2, metal 3, metal 4 (an on), 
interconnections between the metal layers (vias), the pad mask. 

Also, some layers are not drawn directly, but are coded. Additional 
layers are used which form mask patterns only in combination with others. 

Most CMOS processes 
have an n-well (i.e. n-channel 
devices sit in the common 
substrate, while p-channel 
transistors are in common or 
separate n-wells); sometimes 
both n and p-wells are present. 

Drain and source are 
interchangeable, which leads to 
a peculiar but efficient way to 
connect devices in parallel (i.e. 
to increase the channel width): 
you connect them in series and 
merge the terminals between 
the gates (figure 17-13). Thus the source of one transistor also acts as the 
drain of the next one, saving space. 


Fig. 17-13: Parallel connection of n-channel 
transistors. 


The smallest set of matching 
devices consists of four transistors, 
arranged to be point-symmetrical. 
The terminals of the two devices 
labeled A (figure 17-14) are 
connected together, as are those for 
the B transistors. You will find that 
this is almost impossible without 
employing the second metal layer. 

If there is a gradient 
(thermal or otherwise) it will affect 
both devices equally, no matter 
which direction it takes. 


Fig. 17-14: A matching pair of n-channel 
transistors, using four devices. 
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Matching: Myths and Misconceptions 


Over the years a number of rules have accumulated around 
analog design, especially concerning matching devices. For example, 
most designers believe that matching devices should be intermingled 
and as close together as possible, because the diffusions or implants 
have gradients, i.e. vary gradually in depth or concentration over the 
area of the chip. 

A few years ago I had an opportunity to examine this. I 
measured the matching of adjacent devices and compared that with 
devices which were farther apart. To my surprise I found no 
statistically valid difference in matching for a distance of up to 2mm. 

It seems that, perhaps, diffusion gradients were present in the 
early days but, with better furnaces and especially ion implantation, 
have disappeared to the point where they simply no longer play a role. 

To be sure, there are thermal gradients, created by devices 
which heat one area of a chip more than another. For this reason alone 
it is wise to intermingle devices and place them close together (and as 
far from the heat source as possible. 


A second belief divides matching devices into as many small 
pieces as possible, so that they benefit from the statistical effect (large 
groups of devices match better than two single ones). This has proved 
to be only marginally true. As you decrease the size of features, the 
percentage variation becomes greater. Thus, as you approach 
minimum geometry, matching actually becomes worse for the same 
overall area. 


The third belief holds that you should add dummy devices at 
the periphery. There appear to be two different explanations to justify 
this practice: 1. shadows or reflections during exposure act differently 
on the remote edges than on devices in close proximity, or: 2. the etch- 
rate for wide spaces is different from narrow ones. 

I found no difference between groups of resistors with and 
without dummy devices at the periphery. It appears that you might be 
better off using the extra space to make the devices larger. 
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Cross-Unders 


Bipolar analog ICs can often be interconnected using a single metal 
layer, which lowers the cost. But you will inevitably find spots where two 
metal lines need to cross. 

Using the diffused layer with the lowest resistance (emitter, or N+), 
one interconnection stops at a contact, dives 
under the second metal line, and continues at 
the second contact. This introduces a small 
amount of resistance (about 20 Ohms), which 
is tolerable in such places as a base of a 
transistor. 

In figure 17-15 the N+ rectangle is 
placed inside a base diffusion (which sits 
inside an epi-island). One side of the cross- 
under is connected to the base region (it does not matter which side, since 
the voltage drop across the cross-under is bound to be much smaller than 
that of a diode). If the epi-island is biased at the highest positive supply 
voltage, you can have several such cross-unders in the same island. 

The epitaxial region can serve as 
a cross-under as well, though it takes up 
more space. 

A special case in the latter 
approach is the NPN transistor with two 
collector contacts (figure 7-16). Here a 
line connected to the collector stops at 


the right-hand contact and continues on 
Fig. 17-16: Two collector contacts in at the left-hand one. 


an NPN transistors can act as a ; F 
cross-under. But be careful with this scheme. 


Let's assume the resistance between the 
contacts is 100 Ohms. The resistance between one contact and the center 
(i.e. a point underneath the emitter) is then about 50 Ohms. If one contact 
carries all or most of the collector current, the second contact will display 
the voltage at the center and not that of the first contact. 


Fig. 17-15: N+ cross-under. 
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Kelvin Connections 


Any contact on the surface of an IC a 
has some resistance, which often makes a . 
precision measurements difficult. This 
can be avoided by providing two sets of 
contacts, one to carry the current, the other 
to measure the voltage. 

In a Kelvin connection contact 
resistance is of no consequence. The two 


resistances in the current path add a bit to Big Ae Ninconne enon 
: ' for a resistor and its equivalent 
the headroom required, the two in circuit 


measurement path simply need to be 
negligible compared to the measuring impedance. 


Metal Runs and Ground Connections 


The concept of an "analog ground" is often misunderstood. It is 
meant to be a noise-free point (or hub), a spot either on the circuit board or 
on the IC which can be used as a 0-Volt reference. 

The usual practice designates a pin which carries little or no current 
as the analog ground; other pins, intended to be at the same potential but 
carrying current are then connected to this point on the circuit board. 

There is another way to achieve this, one which saves a pin and has 
better performance. A package pin has low resistance, lower than a trace on 
a circuit board or a metal run on the IC. Designate a pin as the analog 
ground and then connect not one but two 
neighboring pads to it with separate bonding 
wires: one carries no current and serves as the 
analog reference ground on the IC, the other 
carries the potentially polluting currents. 

Similarly, on the IC, use separate metal 
runs to connect sensitive devices. In figure 17-18 
the left-hand connection can create an error. 
Assume the runs lead to emitters carrying 1mA. 
With say 50 squares (at 30mOhms per square) of 
additional aluminum for the upper device, the 

voltage drop is 1.5mV, creating a current 
Rigs 16 rep mismatch of 6% at room temperature. With the 
connection (on the right) ; ; a : 
for matching devices. balanced connection on the right this is avoided. 
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Back-Lapping and Gold-Plating 


To fit into small, shallow packages, wafers are often thinned down 
by back-lapping (a somewhat messy, wet grinding operation). This 
removes not only the oxide layer on the back but any diffusions which may 
have taken place there, giving direct access to the substrate material. If you 
add a gold-plating step you get a low-resistance connection directly to the 
substrate. 

Ordinarily such a connection is not essential (the substrate is also 
contacted from the top). But if you have sinned and allowed high substrate 
currents, you may be able to suppress the resulting effects in this way. 


DRC and LVS 


A computer is clearly not as smart as a human being (or so we like 
to think), but it is adamantly intolerant of errors and it never tires. 

There are two checking operations required when a layout is 
finished. The first is Design Rule Checking (DRC), where it is made sure 
that the dimensions and spacings in each layer in each device and for all the 
connections obey the design rules. 

The second compares the layout with the (simulated) schematic 
(layout versus schematic, or LVS). 

One has to face the humiliating fact that the human being is not well 
suited for either job. Great attention to an excruciating amount of detail is 
required, which we cannot handle without making mistakes. 

Only if DRC and LVS are done by computer can you be certain that 
the chip will contain what you think it contains. 


Thus ends this book of the minority field in the world of semiconductors. A 
field past glamour, often neglected, but undeniably essential. And a field of 
great satisfaction for those who know it. 
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